Need to extract text line by line from PDF using itextsharp and put enter at every end of line

Question

I got this function of itextsharp library to extract pdf text line by line:

PdfTextExtractor.GetTextFromPage(reader, page);

...but I need to put ENTER at every line every end of line of pdf even if there is empty row it should read empty row.

`PdfTextExtractor.GetTextFromPage` **does** put end-of-line markers at the end of every line it recognizes (cf. the method `GetResultantText` of the `LocationTextExtractionStrategy`: `sb.Append('\n');`). That being said there generally *is no **end of line** or **row** in a PDF!* Therefore, if iText's heuristics for *interpreting such concepts into the PDF page content* don't work for you, you may need a custom `TextExtractionStrategy` implementation. If you need help with that, please give more details, especially what you get, what you want, and a sample PDF illustrating your issue. — mkl, May 06 '13 at 08:59
+1 for @mkl: There is no such thing as 'a line' in a PDF, nor is there such a thing as 'ENTER'. Content is added at absolute positions; it isn't organized in lines. — Bruno Lowagie, May 06 '13 at 10:18

score 4 · Answer 1 · edited Jun 27 '18 at 08:50

4

read into a string variable then split e.g. String page = PdfTextExtractor.getTextFromPage(reader, 2);

String[] s1 = page.split('\n');

edited Jun 27 '18 at 08:50

Xavier Stévenne

answered May 09 '13 at 12:48

adebayo

score 0 · Answer 2 · edited May 23 '17 at 12:02

0

Please go through the following Links:

edited May 23 '17 at 12:02

Community

answered May 06 '13 at 06:10

Vaibhav Jain

1

Welcome to Stack Overflow! Whilst this may theoretically answer the question, [it would be preferable](http://meta.stackexchange.com/q/8259) to include the essential parts of the answer here, and provide the link for reference. – JJJ May 06 '13 at 06:12
Concerning the stack overflow link: Please make clear that you indeed want to refer to the answers making use of the `PdfTextExtractor` class. – mkl May 06 '13 at 08:47

2 Answers2