Questions tagged [pylucene]

PyLucene is a Python extension for accessing Java Lucene. Its goal is to allow you to use Lucene's text indexing and searching capabilities from Python.

PyLucene is a Python extension for accessing Java Lucene. Its goal is to allow you to use Lucene's text indexing and searching capabilities from Python.

PyLucene is not a Lucene port but a Python wrapper around Java Lucene. PyLucene embeds a Java VM with Lucene into a Python process.

128 questions
1
vote
0 answers

Indexing wikipedia dumps without losing information

I'm working on a search engine using wikipedia dumps. I've split, parsed and extracted clean text from the articles and the next step is to build an index. I chose to use pylucene for that task, but the question is, should I index the whole article…
Asma
  • 129
  • 1
  • 2
  • 9
1
vote
0 answers

Pylucene installation on windows

I changed the Java JDK version to version 8, redefined the environment variables, reinstalled the JCC module successfully and changed the Makefile to have: JAVA_HOME = C:/PROGRA~1/Java/jdk1.8.0_231 And now when I hit 'make' this error…
1
vote
0 answers

JVM in PyLucene

I am planning to use PyLucene as a tokenizer in my project. It is mentioned in PyLucene homepage that it allows python to make calls to Java classes directly via JNI interface. Also, before making calls to Java classes, we first need to call…
Raj
  • 11
  • 1
1
vote
0 answers

Accessing documents marked for deletion

Is there any way to access documents marked for deletion that are still present in segments? If not via Elasticsearch's REST API, then perhaps using Lucene to iterate through documents some way (kind of like this answer maybe)?
1
vote
0 answers

pylucene installation make test fails

I'm following this blog to install pylucene on Ubuntu. I installed all the required files. At the last step which is executing the following scripts: make make test make install I get an error when I run the second line make test. # # A fatal error…
Minions
  • 5,104
  • 5
  • 50
  • 91
1
vote
1 answer

Using SnowBallAnalyzer with PyLucene

I'm trying to use SnowBallAnalyzer in PyLucene but I always get an error saying: InvalidArgsError when I try to create an instance of it like this: analyzer = SnowBallAnalyzer("Spanish") or analyzer = SnowBallAnalyzer("Spanish", STOPWORDS) What I…
Harph
  • 2,250
  • 2
  • 17
  • 16
1
vote
0 answers

Error importing JCC. Cannot find libjava.so

I'm having an issue trying to install PyLucene. I'm running the Makefile provided, but am getting an error at line 385: Traceback (most recent call last): File "/home/89772/anaconda3/lib/python3.7/runpy.py", line 183, in _run_module_as_main …
John Brown
  • 53
  • 5
1
vote
0 answers

Performance comparison between Lucene Vs PyLucene Vs Whoosh

Is there any benchmark comparing Lucene Vs PyLucene Vs Whoosh? Lucene seems to be far ahead in terms of popularity, but I'm looking for something more Pythonic. So just want to get rough idea on the tradeoff.
Dian Bakti
  • 310
  • 5
  • 15
1
vote
2 answers

Trouble with makefile when installing PyLucene

I am trying to build PyLucene on my Windows 10 machine. Here are the details I am currently providing on the Makefile: PREFIX_PYTHON=C:\\Users\\Charlie\\Anaconda3 ANT=C:\\Program Files\\apache-ant-1.10.5\\bin\\ant JAVA_HOME=C:\\Program…
guruman
  • 25
  • 6
1
vote
0 answers

Ubuntu install pylucene-6.5.0 , ivy error

Here is the error message! ant Buildfile: /home/dingyi/pylucene-7.5.0/lucene-java-7.5.0/lucene/build.xml ivy-availability-check: [loadresource] Do not set property disallowed.ivy.jars.list as its length is…
yi ding
  • 11
  • 1
1
vote
1 answer

what is the use of StringField.TYPE_NOT_STORED

I am trying to understand different Field options provided by Lucene, As per my knowledge Lucene will provide the following field options TextField.TYPE_STORED - Analyzed & stored TextField.TYPE_NOT_STORED - Analyzed & Not…
N.Dinesh.Reddy
  • 522
  • 2
  • 7
  • 15
1
vote
0 answers

Segmentation fault (core dumped) on pylucene initVM()

I use pylucene to build a lucene search with python. I think after some system updates on my Ubuntu 16.04.2 LTS workstation today I get an error "Segmentation fault (core dumped)" on every call of lucene.initVM(). Lucene is the default ubuntu…
klein-ich
  • 11
  • 3
1
vote
2 answers

pyLucene Installation

I am on Ubuntu 10.04, Python 2.6.5 & having some trouble installing pyLucene. Here's what I have done so far, installed these packages - sudo apt-get install ant sudo apt-get install sun-java6-jdk sudo update-java-alternatives -s java-1.5.0-sun sudo…
Inception
1
vote
0 answers

pylucene unresolved reference when importing

I have already installed JCC 2.23 and pylucene 6.4.1. However, when I open the sample of pylucene, there are always some errors just as this picture. My Java version is 1.8 and I am using Mac OS X.
NigelZhang
  • 11
  • 4
1
vote
1 answer

Lucene: search within search using FuzzyQuery

I need to make a FuzzyQuery using an index that contains around 8 million lines. That kind of query is pretty slow, needing about 20 seconds for every match. The fact is that I can narrow down the results using another field to about 5000 hits…
Doppelganger
  • 20,114
  • 8
  • 31
  • 29
1 2 3
8 9