Parse PDF with ABCPDF

Question

I want to parse a PDF document I download with ABCPDF, but I cant find any elements in the document or how to reach them and iterate them. I want to parse out some text.

var webClient = new WebClient();
                                var bytes = webClient.DownloadData("http://test.com/test.pdf");

                                var doc = new Doc();
                                doc.Read(bytes);

score 2 · Answer 1 · answered Feb 15 '13 at 09:30

Use the Doc.GetText method to extract content from the current page, specifying the format in which content is to be returned.

doc.PageNumber = 1;
string pageContent = doc.GetText("Text");

The example above will return plain text in layout order. Specifying "SVG" or "SVG+" returns additional information along with the text, such as style and position.

Parse PDF with ABCPDF

1 Answers1