Questions tagged [lucene]

The term Lucene refers to the open source Java fulltext search engine library, but also to the entire eco-system that grew around it, including lucene.net, solr, elasticsearch and zend-search-lucene.

The term "Lucene" refers to the open source Java fulltext search engine library, and also to the entire eco-system that grew around it, including , , and . "Lucene" may also be used to refer to top-level projects like Nutch and Tika which were once sub-projects of Lucene.

Use the "Lucene" tag if either:

  • The question is about the Java library
  • The question is about a port of the library, but would make sense to people who know the Java library (many Lucene.NET questions match this criteria).
  • The question is so general it doesn't apply to a specific implementation (example).

References:

Basic Demo:

A basic "getting started" demo showing how to build and query an index is provided as part of the official documentation:

Basic Demo documentation - (this link is for Lucene v8.7.0. Newer versions may be available)

Links to the demo's source files are provided in the above documentation.

The source code can also be found here on GitHub.

Luke - a Lucene GUI Client:

Luke is a GUI client application which can be used to explore your Lucene indexes. Recent versions of Luke are now provided as part of each binary release, which can be downloaded from here.

After downloading the binary release, unzip it, and go to the luke directory. Launch the client using the provided luke.bat or luke.sh scripts.

11993 questions
3
votes
1 answer

Using Lucene's API, can I boost terms?

With Lucene's query parser, it's possible to boost terms (causing them to weight higher in the search results) by appending "^n", e.g. "apple^5 pie" would assign five times more importance to the term "apple". Is it possible to do this when…
Frungi
  • 506
  • 5
  • 16
3
votes
2 answers

Sitecore Lucene Search missing file inside Data\indexes\__system folder

We are using Sitecore(Version 6.4) Lucene Search. When i am trying to search any keyword, it is throwing file not found exception. Lucene is looking for file _3zj.fnm inside Data\indexes__system\ folder. I tried to rebuild search indexes from…
Neil
  • 435
  • 3
  • 7
  • 19
3
votes
1 answer

Best practices for syncing Lucene repository with source data?

I am designing an application which will have a heavy reliance on searching using a Lucene.NET repository. The repository will be built using data from an operational database that is constantly changing. I'm trying to figure out the best strategy…
RationalGeek
  • 9,425
  • 11
  • 62
  • 90
3
votes
1 answer

Searching in Liferay document library by document extension

I was developing a module for advanced search over liferay content, and I was stuck in document library search, it was asked to search in documents by document type, and I wish to know the feasibility of such feature, I've check liferay code and…
3
votes
3 answers

Is there a fast, accurate Highlighter for Lucene?

I've been using the (Java) Highlighter for Lucene (in the Sandbox package) for some time. However, this isn't really very accurate when it comes to matching the correct terms in search results - it works well for simple queries, for example…
Mat Mannion
  • 3,315
  • 2
  • 30
  • 31
3
votes
3 answers

Where is the default path to the apache lucene index file with grails searchable plugin?

I'm using in my grails 1.3.7 project the searchable plugin. My project was running well until my macbooks harddrive was full and osx threw an message about memory problems and that it'll delete some files (tmp and so on) (after that i deleted some…
whitenexx
  • 1,350
  • 2
  • 25
  • 53
3
votes
1 answer

Storing multiple values in single property of Neo4j Lucene index

I want to store multiple values in a single index property of neo4j Lucene index, e.g. IndexName: profile property- Education: "Stanford University, Grad School", "Harvard University, MS" property- Work: "Nokia Siemens Networks", "Motorola"…
wakeup
  • 495
  • 4
  • 19
3
votes
1 answer

Updates to SolrConfig.xml file are not being reflected

I added in a new request handler in the SolrConfig.xml file, however anytime I try to use the new handler I get back a 404 with "unknown handler" error. Are there any other files to modify when the SolrConfig.xml file is modified? Thanks
aster
  • 146
  • 2
  • 10
3
votes
3 answers

Improving lucene spellcheck

I have a lucene index, the documents are in around 20 different languages, and all are in the same index, I have a field 'lng' which I use to filter the results in only one language. Based on this index I implemented spell-checker, the issue is…
zebra
  • 1,330
  • 1
  • 13
  • 26
3
votes
5 answers

What's your strategy for collecting Solr metrics?

Is there any open source tools for this? Something with pretty graphs maybe? Or is all we have in /admin/stats.jsp? I am looking for monitoring tools, search metrics mining, and log analysis. Thanks!
jprado
  • 308
  • 2
  • 12
3
votes
1 answer

Lucene experts: how best to run diagnostics against an IndexWriter to resolve performance issues?

I've got an index that currently occupies about 1gb of space and has about 2.5 million documents. The index is stored on a solid-state drive for speed. I'm adding 2500 documents at a time and committing after each batch has been added. The index is…
Nathan Ridley
  • 33,766
  • 35
  • 123
  • 197
3
votes
2 answers

How do I pass a list of 'allowed' IDs to filter a Lucene search?

I need to return just the documents that a user has access to from a Lucene search. I can get a list of IDs from a database that make up the 'allowed' subset. How can I pass these to Lucene? The articles I've found on the web suggest I need to use a…
Nick
  • 5,616
  • 10
  • 52
  • 72
3
votes
1 answer

Zend Lucene with symfony and i18n

I've went through the Jobeet Tutorial for integrating Zend Lucene into a symfony (1.4.8) project in order to add search capabilities into my frontend of my site (through indexing). Among others, the key concept is to use updateLuceneIndex during…
pankar
  • 1,623
  • 1
  • 11
  • 21
3
votes
2 answers

How to work around the hibernate search (lucene) criteria restriction plus getResultSize issue?

Hibernate search (lucene) has an issue: If you apply a criteria restriction to FullTextQuery and apply pagination restrictions, i.e. criteria.setMaxResults() and criteria.setFirstResult(), the fullTextQuery.getResultSize() returns a count which…
jaseFace
  • 1,415
  • 5
  • 22
  • 34
3
votes
1 answer

Hibernate search storing byte[] in index

I have following config in my POJO: @Field(index = Index.NO, store = Store.YES) private byte[] file; When I start my Apps container I get following exception: Caused by: org.hibernate.search.SearchException: Unable to guess FieldBridge for file …
MatBanik
  • 26,356
  • 39
  • 116
  • 178