I know there are already objects supporting Office 2007 files, but is there any native Office 2003 or earlier support ?
Asked
Active
Viewed 286 times
2 Answers
1
There doesn't seem to be anything bundled with Zend_Search_Lucene
, for those.
Still, considering it can index HTML documents, if you can find a way to convert your Office 2003 documents to HTML (at least, for indexing -- keeping to original version alonside the HTML one, for consultation), you might be able to index those...

Pascal MARTIN
- 395,085
- 80
- 655
- 663
0
I would recommend indexing the documents with Solr and Tika together and using JSON to search your Solr/Lucene index from PHP. See the ExtractingRequestHandler (Solr wiki page) article for more information.

Brian
- 1,337
- 9
- 25