Questions tagged [solr]

Apache Solr is an open source search server based on the Lucene Java search library.

Apache Solr is an open source search server based on the search library.

Its major features include powerful full-text search, hit highlighting, faceted search, dynamic clustering, database integration, and rich document (e.g. , ) handling. Solr is highly scalable, providing distributed search and index replication, and it powers the search and navigation features of many of the world's largest internet sites.

Solr uses the Lucene Java search library at its core for full-text indexing and search and has REST-like HTTP/XML and APIs that make it easy to use from virtually any programming language. Solr's powerful external configuration allows it to be tailored to almost any type of application without Java coding, and it has an extensive plugin architecture when more advanced customization is required.

Solr can be accessed via HTTP or via numerous client libraries, such as

References:

See also

20557 questions
34
votes
3 answers

Indexing .PDF, .XLS, .DOC, .PPT using Lucene.NET

I've heard of Lucene.Net and I've heard of Apache Tika. The question is - how do I index these documents using C# vs Java? I think the issue is that there is no .Net equivalent of Tika which extracts relevant text from these document types. UPDATE…
dana
  • 17,267
  • 6
  • 64
  • 88
34
votes
4 answers

Solr query (q) or filter query (fq)

I have a ~1 mil product document Solr index. I also have a whole bunch of UI filters such as, categories, tabs, price ranges, sizes, colors, and some other filters. Is it the right way to have the q selecting everything (q=\*:\*) while all other…
Tommy Lord
  • 543
  • 1
  • 7
  • 13
33
votes
3 answers

Solr date field tdate vs date?

So I have a question about Solr's field date types which is pretty straight forward: what's the difference between a 'date' field and a 'tdate' one? The schema .xml claims that 'For faster range queries, consider the tdate type' and 'A Trie based…
Osvaldo Mercado
  • 960
  • 3
  • 13
  • 24
33
votes
7 answers

Rails app: Solr throwing RSolr::Error::Http - 404 Not Found when executing search

Very lost as I haven't made any changes to my search which has always been working but somehow I've got a break. Being relatively new to Rails, the error output isn't giving me enough detail to debug. Relevant code below - ask if you want to see…
justinraczak
  • 776
  • 2
  • 9
  • 24
32
votes
2 answers

Full text search: Whoosh Vs SOLR

I am working on a Django project, where I need to implement full text search. I have seen SOLR and found some good comments for the same. But as its implemented in Java and would need java enviroment to be installed on the system along with Python.…
Ankit Jaiswal
  • 22,859
  • 5
  • 41
  • 64
32
votes
4 answers

Search multiple SOLR core's and return one result set

We have several core's setup in SOLR and would like to search across these cores and return a single result set. A bit more background: We have a SOLR core that we index our internal blog engine. We also have a SOLR core that we index our CMS…
Rihan Meij
  • 1,759
  • 1
  • 14
  • 18
31
votes
6 answers

Boolean NOT in solr query

How do I pick/ delete all the documents from Solr using the boolean NOT notion? i.e. How do I delete all the documents from Solr who's id does NOT start with A59?
Mohit Ranka
  • 4,193
  • 12
  • 41
  • 41
31
votes
3 answers

What is the advantage of storing schema in avro?

We need to serialize some data for putting into solr as well as hadoop. I am evaluating serialization tools for the same. The top two in my list are Gson and Avro. As far as I understand, Avro = Gson + Schema-In-JSON If that is correct, I do not see…
user2250246
  • 3,807
  • 5
  • 43
  • 71
30
votes
8 answers

How to fix: Error CREATEing SolrCore 'gettingstarted': Unable to create core

I'm getting this error when I try to create a new core in solr. root@ubuntu:/opt/solr# bin/solr create -c gettingstarted -n data_driven_schema_configs Setup new core instance directory: /var/solr/data/gettingstarted Creating new core…
JackXandar
  • 503
  • 1
  • 5
  • 14
30
votes
3 answers

SOLR autoCommit vs autoSoftCommit

I'm very confused about and . Here is what I understand autoSoftCommit - after a autoSoftCommit, if the the SOLR server goes down, the autoSoftCommit documents will be lost. autoCommit - does a hard commit to the disk and make sure all the…
user794783
  • 3,619
  • 7
  • 36
  • 58
29
votes
8 answers

How to get all results from solr query?

I executed some query like "Address:Jack*". It show numFound = 5214 and display 100 documents in results page(I changed default display results from 10 to 100). How can I get all documents.
SENTHIL SARAVANAN
  • 719
  • 1
  • 12
  • 28
28
votes
3 answers

Solr fetching date ranges

Well, i'm coding some methods for returning solr docs that mach a interval date range. Docs stored date fields with ISO 8601 format. Any idea? thx
Lici
  • 998
  • 4
  • 13
  • 22
28
votes
6 answers

Solr vs Hibernate Search - Which to choose and When?

We are building an ecommerce application. We are using JAVA stack with Hibernate and Spring Framework. As with all ecommerce application, we need to build search capability into ours. So, we came across Hibernate Search and Apache Solr . Can someone…
Lucky Murari
  • 12,672
  • 5
  • 22
  • 43
28
votes
2 answers

Setup sunspot solr with rails in production environment

I have tried various links but I can't seem to find a good resource on creating a running solr instance that works with rails in production. I understand that you have to setup the solr server for production. I have tried the setup of solr with…
Lester Celestial
  • 1,454
  • 1
  • 16
  • 26
28
votes
4 answers

Can a raw Lucene index be loaded by Solr?

Some colleagues of mine have a large Java web app that uses a search system built with Lucene Java. What I'd like to do is have a nice HTTP-based API to access those existing search indexes. I've used Nutch before and really liked how simple the…
mw542