I need to index a long list of documents (mostly ms office formats, pdf) and perform full text search and support versioning.
I read about lucene but it seems far to be a complete solution, does anyone know a commercial complete indexer?
I need to index a long list of documents (mostly ms office formats, pdf) and perform full text search and support versioning.
I read about lucene but it seems far to be a complete solution, does anyone know a commercial complete indexer?
For versioning use git or mercurial.
For the "full text search" I found some links:
You can try Recognition Server, it's high-volume OCR, document conversion and indexing software. http://www.abbyy.com/recognition_server/
This software creates searchable digital archives. You can download trial version and try it for free