Questions tagged [lucene]

The term Lucene refers to the open source Java fulltext search engine library, but also to the entire eco-system that grew around it, including lucene.net, solr, elasticsearch and zend-search-lucene.

The term "Lucene" refers to the open source Java fulltext search engine library, and also to the entire eco-system that grew around it, including , , and . "Lucene" may also be used to refer to top-level projects like Nutch and Tika which were once sub-projects of Lucene.

Use the "Lucene" tag if either:

  • The question is about the Java library
  • The question is about a port of the library, but would make sense to people who know the Java library (many Lucene.NET questions match this criteria).
  • The question is so general it doesn't apply to a specific implementation (example).

References:

Basic Demo:

A basic "getting started" demo showing how to build and query an index is provided as part of the official documentation:

Basic Demo documentation - (this link is for Lucene v8.7.0. Newer versions may be available)

Links to the demo's source files are provided in the above documentation.

The source code can also be found here on GitHub.

Luke - a Lucene GUI Client:

Luke is a GUI client application which can be used to explore your Lucene indexes. Recent versions of Luke are now provided as part of each binary release, which can be downloaded from here.

After downloading the binary release, unzip it, and go to the luke directory. Launch the client using the provided luke.bat or luke.sh scripts.

11993 questions
3
votes
2 answers

Solr/Lucene spellcheck suggestions based on multiple fields

I have a database with Vendor's information: name and address (address, city, zip and country fields). I need to search this database and return some vendors. On the search box, the user could type anything: name of the vendor, part of the address,…
nepomucenobr
  • 278
  • 1
  • 3
  • 12
3
votes
2 answers

How to add special characters in lucene search ? c#

I am using Standard analyzer from lucene in my search engine to search for german words this is my code: private IList GetFromLucene(string terme, string FieldName) { TopDocs hits; CustomAnalyzer standardAnalyzer = new…
Youssef Boudaya
  • 123
  • 1
  • 2
  • 15
3
votes
0 answers

Is there a known implementation that enables mongo to work with Apache Lucene (like mongo does with Atlas search)?

I was wondering if there is some package/docker that gives the functionality that mongo gives with Atlas search using Apache lucene behind the scene. Practically, I want to have a small self-hosted docker (can't work with Atals) with mongo and…
yoka791
  • 576
  • 5
  • 17
3
votes
2 answers

Solr terms component over multiple fields?

I am able to receive the most frequently used terms in my index via the terms compontent described here: http://wiki.apache.org/solr/TermsComponent However this only seems to work for exactly one field. I would really like to have this…
The Surrican
  • 29,118
  • 24
  • 122
  • 168
3
votes
3 answers

Is Solr a Good Solution for the Problem Described Below?

I am volunteering for a non-profit and the CEO would like an application that stores resumes of University professors. The resumes are to be searchable so that possible employers can find them. The resumes could be in a variety of formats, including…
Sam
  • 81
  • 2
  • 7
3
votes
1 answer

In Solr, why is 'built' not being stemmed to 'build' but 'building' is?

I'm trying to figure out two things in this posting: Why is 'built' NOT being stemmed to 'build' even though the field type definition has a stemmer defined. However, 'building' is being stemmed to 'build' How to use Luke to examine the index to…
jabawaba
  • 279
  • 1
  • 6
  • 16
3
votes
0 answers

Difference between storing position and offset information in term vector vs index options in Lucene

In my understanding in Lucene while creating a Field we can specify and IndexableFieldType. FieldType is the concrete implementation of IndexableFieldType. Using FieldType we can control among other things: Index Options: These help us control…
Ishan
  • 154
  • 1
  • 1
  • 9
3
votes
1 answer

Where can I get XSD schema for solrconfig.xml and schema.xml

I want the XML schemas defining what elements can appear where in solrconfig and schema XML files, for some IDE completion help, and also to handwrite some config, instead of copy pasting from the net where there is some mixed content for many solr…
Zasz
  • 12,330
  • 9
  • 43
  • 63
3
votes
0 answers

How to incorporate Queried text's term frequencies in Elasticsearch document scoring

TF-IDF or BM25 works by extracting terms from queried text then Scoring the matching documents based on the presence of those terms and takes into account the length of the field. I want queried text's term frequencies to influence the document…
varshneyanmol
  • 247
  • 1
  • 5
3
votes
3 answers

Search multiple indices at once using Lucene Search

I am using Zend_Search_Lucene to implement site search. I created separate indices for different data types (e.g. one for users, one for posts etc). The results are similarly divided by data type however there is an 'all' option which should show a…
Akeem
  • 7,897
  • 5
  • 32
  • 41
3
votes
2 answers

Why do I get "ManagedIndexSchema Error persisting managed schema => FileNotFoundException: (Access is denied)" when adding a field to a Solr core?

I have a java program running on windows that: Run Solr Server using "bin/solr.cmd restart -f -q -p 55183". Delete (if exists) the old core named "my_core" using "bin/solr.cmd delete -c my_core". Create a new core named "my_core" using…
David
  • 59
  • 3
3
votes
2 answers

store documents based on sort order in lucene index

I have two field (name, modifiedDate) in my index. i want to store new document based on modifiedDate and keep index sorted on modifiedDate doc #1 is the oldest document and (modifiedDate) is oldest too doc #n is most recent document and…
Ehsan
  • 1,662
  • 6
  • 28
  • 49
3
votes
2 answers

Running Lire image search inside Solr -- how?

I'd like to use Lire image search from within Solr. Lire is built on top of Lucene: http://www.semanticmetadata.net/lire/ What's the best way to integrate Lire in Solr? I'd especially appreciate links to any success stories or sample code in this…
fish2000
  • 4,289
  • 2
  • 37
  • 76
3
votes
0 answers

MultiField.getFields equivalent in Lucene 8?

I am using Lucene 8.10.1. I have the following code : IndexReader reader = DirectoryReader.open(FSDirectory.open(Paths.get(index))); try { Fields fields = MultiFields.getFields(reader); for (String termfield : fields) { Terms…
Kiera.K
  • 317
  • 1
  • 13
3
votes
1 answer

zend search lucene query api: boosting a term

I am using zend search lucene, and would like to add boosts to some of my search terms. The code is already written using the query construction API as follows. $query->addTerm(new Zend_Search_Lucene_Index_Term($name,'name'), null); I tried…
1 2 3
99
100