2

What the best way to index pdf documents? Should I index them by converting pdf documents to txt or there is a better way to index pdf files?

javanna
  • 59,145
  • 14
  • 144
  • 125
Ahsan Iqbal
  • 1,422
  • 5
  • 20
  • 39

1 Answers1

3

Assuming you're talking about solr: see the ExtractingRequestHandler.

msbmsb
  • 1,406
  • 9
  • 9