Questions tagged [pylucene]

PyLucene is a Python extension for accessing Java Lucene. Its goal is to allow you to use Lucene's text indexing and searching capabilities from Python.

PyLucene is a Python extension for accessing Java Lucene. Its goal is to allow you to use Lucene's text indexing and searching capabilities from Python.

PyLucene is not a Lucene port but a Python wrapper around Java Lucene. PyLucene embeds a Java VM with Lucene into a Python process.

128 questions
3
votes
1 answer

Finding a single fields terms with Lucene (PyLucene)

I'm fairly new to Lucene's Term Vectors - and want to make sure my term gathering is as efficient as it possibly can be. I'm getting the unique terms and then retrieving the docFreq() of the term to perform faceting. I'm gathering all documents…
Ben DeMott
  • 3,362
  • 1
  • 25
  • 35
3
votes
3 answers

How to build and install pylucene on ubuntu 20.04

I am trying to install Pylucene on my WSL Ubuntu 20.04 clean installation. I tried to follow tutorial on the official page but it looks outdated. So I was wondering if anyone here managed to make it work on Ubuntu 20.04 and python 3.8.2 The…
Joozty
  • 460
  • 2
  • 12
  • 40
3
votes
2 answers

jcc.initVM() doesn't return when mod_wsgi is configured as daemon mode

I am using mod-wsgi with django, and in django I use pylucene to do full text search. While mod-wsgi is configured to be embedded mode, there is no problem at all. But when mod-wsgi is configured to be daemon mode, the apache just gets stuck, and…
Ross
  • 2,033
  • 4
  • 19
  • 19
3
votes
1 answer

Looking for pylucene tutorial

I finally was able to install the latest version of pylucene, though I'm not able to locate tutorials or examples on how to do some basic funtionalities like indexing, specifying the query and retrieving documents.
Notra
  • 39
  • 1
  • 2
2
votes
1 answer

How to get a list of all tokens from Lucene 8.6.1 index?

I have looked at how to get a list of all tokens from Solr/Lucene index? but Lucene 8.6.1 doesn't seem to offer IndexReader.terms(). Has it been moved or replaced? Is there an easier way than this answer?
PSK
  • 347
  • 2
  • 13
2
votes
1 answer

Using Fuzzy and Prefix match with Span_multi in ElasticSearch

My use case is to have multiple phrases/words highlighted in a field that is retrieved based on fuzzy and prefix matching. For this I have currently used span_near to handle phrases and span_multi for allowing the use for fuzziness in my phrase…
2
votes
0 answers

Package python software with pylucene dependency

I'm working on a python project that needs pylucene(python wrapper for lucene, a java library for search-engines programming). I've created a Dockerfile that automatically downloads and compile pylucene; then also installs other needed pip…
2
votes
1 answer

Implementing custom Similarity in PyLucene

I want to write a custom Similarity class in PyLucene to implement my own retrieval model. Usually, using the java version of Lucene you have to extend the Similarity class and override the methods. For example: public class IDFSimilarity extends…
Salias
  • 480
  • 6
  • 19
2
votes
1 answer

JCC failed to link python3.5 for pylucene installation

I'm trying to install JCC, which is a prerequisite of pylucene. I'm working with linux-mint Serena. I want to install it with python3.5. After checking the setup.py file I run: python3.5 setup.py build Here is the error I get: x86_64-linux-gnu-g++…
user3091275
  • 1,013
  • 2
  • 11
  • 27
2
votes
1 answer

How to install PyLucene 6.0.2 in windows

I'm trying to create a simple application on the Information Retreival using python & PyLucene. Can you help me to install PyLucene 6.2.0 on windows , I already installed jcc , but i didn't know how to install PyLucene. I found this documentation…
Asma
  • 137
  • 3
  • 15
2
votes
1 answer

How to install Pylucene locally without root access on a server?

recently I try to install pylucene on a server in my own home directory because I have no root priviledge. Now I have successfully installed ant and JCC locally on CentOS under my own home directory. The directory of JCC is…
2hc
  • 37
  • 1
  • 5
2
votes
4 answers

ImportError: no module named _jcc

I'm trying to install pylucene on python 2.7 (windows) since four days. It requires JCC to build and install. After thousands and thousands different errors I handled myself, at last JCC sucessfully builded and installed. At least, that was what I…
Teoman Unay
  • 21
  • 1
  • 2
2
votes
0 answers

How can I create my own TokenFilter in PyLucene inherited from PythonTokenFilter

everyone: I'm developing my own Analyzer in PyLucene 4.9.0 and created a TokenFilter for CompoundTokenFilter in the analyzer as the DictionaryCompoundTokenFilter not performing very well. DictionaryCompoundTokenFilter uses a brute algorithm, but I'd…
wangdan
  • 21
  • 3
2
votes
2 answers

PyLucene make fails : [ivy:retrieve] error

I am following the instructions to install PyLucene, as given in : https://lucene.apache.org/pylucene/install.html The problem is when I make, ivy can't retrieve something and it fails. [ivy:retrieve] You probably access the destination server…
Avisek
  • 363
  • 1
  • 3
  • 16
2
votes
2 answers

Lucene's MultiFieldQueryParser ignores setDefaultOperator

I'm trying to change the default operator for a MultiFieldQueryParser: fields = [...] query = "hello stackoverflow" clauses = [BooleanClause.Occur.SHOULD, BooleanClause.Occur.SHOULD, ...] parser = MultiFieldQueryParser(Version.LUCENE_CURRENT,…
Peter Clause
  • 1,132
  • 9
  • 22
1
2
3
8 9