3

I need to write a program that can find the largest text in a PDF document (which is probably the title!), I've tried using Tika as a parser but I couldn't find a way to get the font size. Also the Tika library seems too powerful and complex for my use. Is there an easier way/library that I can use? Is it easier or better to not use a library and parse the PDF myself (I am kinda interested in this approach)? If so, where should I start?

Any information is greatly appreciated.

Auxiliary
  • 2,687
  • 5
  • 37
  • 59

1 Answers1

1

I found this method on SO that overrides the processTextPosition from PDFTextStripper in the PDFBox library and extracts the font size. I didn't see it before because the post's question was a bit different.

Community
  • 1
  • 1
Auxiliary
  • 2,687
  • 5
  • 37
  • 59