Questions tagged [lucene]

The term Lucene refers to the open source Java fulltext search engine library, but also to the entire eco-system that grew around it, including lucene.net, solr, elasticsearch and zend-search-lucene.

The term "Lucene" refers to the open source Java fulltext search engine library, and also to the entire eco-system that grew around it, including , , and . "Lucene" may also be used to refer to top-level projects like Nutch and Tika which were once sub-projects of Lucene.

Use the "Lucene" tag if either:

  • The question is about the Java library
  • The question is about a port of the library, but would make sense to people who know the Java library (many Lucene.NET questions match this criteria).
  • The question is so general it doesn't apply to a specific implementation (example).

References:

Basic Demo:

A basic "getting started" demo showing how to build and query an index is provided as part of the official documentation:

Basic Demo documentation - (this link is for Lucene v8.7.0. Newer versions may be available)

Links to the demo's source files are provided in the above documentation.

The source code can also be found here on GitHub.

Luke - a Lucene GUI Client:

Luke is a GUI client application which can be used to explore your Lucene indexes. Recent versions of Luke are now provided as part of each binary release, which can be downloaded from here.

After downloading the binary release, unzip it, and go to the luke directory. Launch the client using the provided luke.bat or luke.sh scripts.

11993 questions
3
votes
2 answers

Teracotta and Hibernate Search

Does anyone have experience with using Terracotta with Hibernate Search to satisfy application Queries? If so: What magnitude of "object updates" can it handle? (How's the performance) What kind of performance do the Queries have? Is it possible…
mainstringargs
  • 13,563
  • 35
  • 109
  • 174
3
votes
1 answer

InnoDB Full Text Searches

I have never used full text searching and in starting to learn how to do so am coming across all sorts of pages and articles stating the facts about how full text searching is only available on myisam and what a bad choice that is etc etc. I've also…
ryandlf
  • 27,155
  • 37
  • 106
  • 162
3
votes
1 answer

Lucene: Non-Range queries on Numeric field

We are currently using lucene 2.3.2. We pad integers with leading zeroes and index so that we can support range queries as well. We are working towards upgrading to lucene 3.4 and want to use NumericRangeQueries. We also want to support non-range…
naresh
  • 2,113
  • 20
  • 32
3
votes
1 answer

How to index/query nested documents in lucene 3.4?

I'm looking for a way to understand how the newly implemented nested document feature works in lucene 3.4. There doesn't seem to be any tutorial available on the net yet considering that it's a very recent feature. Can somebody please demonstrate on…
ananthv
  • 533
  • 1
  • 5
  • 12
3
votes
1 answer

BlockJoin Query in lucene

Can anyone elaborate about BlockJoinQuery in lucene with a simple scenario and source code? I couldnt find any fruitful links from Google for BlockJoinQuery. Any links which will detail about this will be useful for me. Thanks in advance
Jagadesh
  • 6,489
  • 8
  • 29
  • 30
3
votes
4 answers

Is there any Lucene wrapper in C / Objective-C?

I know there is the CLucene project, which is a port of Lucene from Java to C. But is there a Lucene wrapper in C/Objective-C similar to PyLucene that uses JNI and and embeds a JavaVM with Lucene in a separate process?
Tony
  • 36,591
  • 10
  • 48
  • 83
3
votes
1 answer

Lucene - few or a lot of indexes

Is it better to use a lot of indexes (eg. for every user as your application allows that) in Lucene or just one, having every document in int ... if you think about: performance disk space health I am using elasticsearch, therefore I am using…
maerzbow
  • 185
  • 1
  • 2
  • 9
3
votes
1 answer

Solr/Lucene Multivalued Fields - maximum number of values?

Is there an actual or practical limit to the number of values that can be placed in a multivalued field in Solr/Lucene? For example, if I want to have an indexed field with 5000 numeric values, will this cause problems? Thanks.
Frank LaRosa
  • 3,533
  • 6
  • 26
  • 32
3
votes
2 answers

Nested prohibit/require operators in Lucene search queries

I am using Lucene for Java, and need to figure out what the engine does when I execute some obscure queries. Take the following query: +(foo -bar) If I use QueryParser to parse the input, I get a BooleanQuery object that looks like…
ysim
3
votes
1 answer

Solr core's numDocs does not equal sum of processed documents

I have an issue while building my Solr index (Lucene & Solr 3.4.0 on an Apache Tomcat 6.0.33). The data for the documents to index comes out of an Oracle database. Since I have to handle loads of CLOBs, I splitted up the dataimport into several…
tobwoerk
  • 118
  • 7
3
votes
2 answers

Lucene in Java, C#.Net and C++. Which is the best version for long-term use on Windows server?

I am going to implement Lucene search into my project and I want to make a best start. So I consider between 3 versions of Lucene (Java/C#.Net/C++) which is the best version upon these criterias : 1.performance 2.easy to implement 3.plenty of…
JatSing
  • 4,857
  • 16
  • 55
  • 65
3
votes
0 answers

Solr 4 term list differs from analysis result

I was trying to use the MoreLikeThisComponent to find similar documents. With one of the results I was wondering about the terms used by MLT, since the "interestingTerms" contained terms that were not part of the text analysis results. Here is what…
Jan Rasehorn
  • 311
  • 2
  • 6
3
votes
1 answer

Using stop words with WhitespaceAnalyzer

Lucene's StandardAnalyzer removes dots from string/acronyms when indexing it. I want Lucene to retain dots and hence I'm using WhitespaceAnalyzer class. I can give my list of stop words to StandardAnalyzer...but how do i give it to …
Steve Chapman
  • 1,317
  • 4
  • 23
  • 34
3
votes
2 answers

How to do a full text search in Cocoa?

I need something like Lucene to do an optimized full text search in Cocoa. I am working on an Iphone app to search through a database. Anybody has any luck with other databases. Any help is appreciated. So far, I can only find…
Srikar Doddi
  • 15,499
  • 15
  • 65
  • 106
3
votes
3 answers

Impact of repeat value across multiple fields in Lucene

What would be the impact of re-indexing the same value across multiple fields in a lucene index? The idea is that someone's first name is a part of their name and their general details. So I would want to index that value into multiple fields. Ted…
Edd
  • 8,402
  • 14
  • 47
  • 73