Extract Full Paragraphs from Plain Text Documents

nateTheNotSoGreat · July 27, 2023, 4:42am

I have unformatted text documents that contain semi standard sentences/paragraphs that I would like to pull out so that I can format the document with the correct information in the correct place. I am thinking I would need to do some sort of attention layer to return the start and end tokens but can’t find any examples.

Think: I have an empty document template with multiple sections that need to be filled in, and an unformatted document with all of the information in text form. How would you split up the text file so that all of the information can be placed in to the template.

Topic		Replies	Views
Extract data/snippets from text General Discussion nlp , learning , help_request	6	2927	July 27, 2023
Classify words by meaning in booking documents General Discussion models	1	783	February 6, 2024
Review of legal documents in PDF - suggestion needed General Discussion help_request	0	279	March 6, 2023
How to get started analyzing html documents Keras keras , ml	1	79	October 15, 2025
Detecting units of measure in text? General Discussion models , help_request	1	653	February 1, 2024

Extract Full Paragraphs from Plain Text Documents

Related topics