1

I have pdf/word/html documents containing around approx 700-800 pages on an average. I want to build a rails search engine application using solr/lucene and sunspot gem. How can I manage these documents' storage? Shall I parse whole documents and store it in database or search directly in documents. Which one is scalable and performance oriented option? Consider I have documents around 1-2 lacs.

Data Don
  • 338
  • 2
  • 11
  • I think this link is also similar to above. http://stackoverflow.com/questions/7739193/searching-attachments-from-a-rails-app-word-pdf-excel-etc http://rny.io/rails/elasticsearch/2013/08/05/full-text-search-for-attachments-with-rails-and-elasticsearch.html – Vishal Zambre Nov 11 '14 at 05:03

0 Answers0