0

Tika can use a url parameter to extract remote files. and we can also download the remote file, then let tika extract it like a local file. From the performance and correctness views, which way is a better choice? Thanks.

javanna
  • 59,145
  • 14
  • 144
  • 125
internal
  • 1
  • 1

1 Answers1

0

It would be faster to just extract it from the URL, thus avoiding excessive read/write on you local system. Thats what I do when parsing files with tika. Fast and straight forward.

Bacaa14
  • 77
  • 1
  • 8