Questions tagged [pylucene]

PyLucene is a Python extension for accessing Java Lucene. Its goal is to allow you to use Lucene's text indexing and searching capabilities from Python.

PyLucene is a Python extension for accessing Java Lucene. Its goal is to allow you to use Lucene's text indexing and searching capabilities from Python.

PyLucene is not a Lucene port but a Python wrapper around Java Lucene. PyLucene embeds a Java VM with Lucene into a Python process.

128 questions
2
votes
1 answer

PyLucene : ImportError: cannot import name SimpleFSDirectory

So I want to play a little with PyLucene because someone gave me a cfs file and PyLucene looked like the best way to deal with that in python. I followed only the tutorial from http://bendemott.blogspot.co.uk/2013/11/installing-pylucene-4-451.html I…
PonteIneptique
  • 175
  • 1
  • 1
  • 12
2
votes
2 answers

ImportError: cannot import name Library, during installing PyLucene

I have spent 5 hours to fix this problem but I can't. During installing PyLucene with http://lucene.apache.org/pylucene/install.htmt , I faced with a error like follwing. sanghee-m:jcc sanghee$ python setup.py build found JAVAFRAMEWORKS =…
sangheestyle
  • 1,037
  • 2
  • 16
  • 28
2
votes
1 answer

from _jcc import initVM DLL load failed: The specified module could not be found

Hi I am trying to port my working Django v1.3 application that uses Pylucene from DJANGO Dev server to Apache with mod_wsgi where it gives an error on the same Windows XP machine. Error Message from Django debug: ViewDoesNotExist at…
Carlos Ferreira
  • 1,980
  • 2
  • 14
  • 18
2
votes
0 answers

Using ShingleFilter to build costomized analyzer in PyLucene

I am pretty new to Lucene and Pylucene. This is a problem when I am using pylucene to write a customized analyzer, to tokenize text in to bigrams. The code for analyzer class is: class BiGramShingleAnalyzer(PythonAnalyzer): …
DiamRem
  • 604
  • 1
  • 4
  • 9
2
votes
0 answers

Log-Likelihood Ratio statistic by PyLucene

Currently I am analyzing a pretty huge amount of text. I would like to perform a Log-Likelihood Ratio on two wordlists in order to identify frequency deviations of terms in the foreground corpus vs the normative corpus. I have coded the…
1
vote
1 answer

Get field's tokens from lucene index

How can I get the tokens (whether it be the list of tokens, TokenStream, or something else) that were used for a Field within a Document from a lucene index? That is, is it possible to get the tokens that were used in tokens (from the example) from…
Uyghur Lives Matter
  • 18,820
  • 42
  • 108
  • 144
1
vote
2 answers

Search Index - 12 = Twelve

I'm curious what is the best method do deal with tokenizing/indexing terms (In Lucene) or any search engine for that matter so that these searches would match corresponding terms. "12" = "twelve" "mx1" = "mx one" Is there any built-in functionality…
Ben DeMott
  • 3,362
  • 1
  • 25
  • 35
1
vote
2 answers

pylucene: install error

UPDATE question: Thanks Favoretti! I installed jcc using apt-get and the makefile under pylucene ran a little bit and threw a java error and I dont have a clue. Here it is. Can you please let me know what error it is? While loading…
Zenvega
  • 1,974
  • 9
  • 28
  • 45
1
vote
1 answer

PyLucene Custom TokenStream using PythonTokenStream

I am attempting to build a TokenStream from a Python Sequence. Just for fun I want to be able to pass my own Tokens directly to pylucene.Field("MyField", MyTokenStream) I tried to make "MyTokenStream" by... terms = ['pant', 'on', 'ground', 'look',…
Ben DeMott
  • 3,362
  • 1
  • 25
  • 35
1
vote
0 answers

failed importing ICUFoldingFilter while using pylucene

When issuing an import statement from lucene.ICUFoldingFilter import ICUFoldingFilter I'm getting an error: ValueError: (2, 'No such file or directory',…
user2773013
  • 3,102
  • 8
  • 38
  • 58
1
vote
1 answer

pylucene fuzzy search not return anything even with the same search term

I'm trying to build a fuzzy search query in pylucene, but even using the same term that was already uploaded it doesn't return anything. I've tried upload the fields as StringField or TextField and even using a customized one and change the…
1
vote
0 answers

Lucene Search based on edit-distance on entire text rather than individual tokens

I am using SpanNearQuery with SpanMultiTermQueryWrapper to match my query text with an edit-distance of either 1 or 2 containing more than a word with the documents each containing multiple tokens Here I need to specify the edit distance for each…
1
vote
0 answers

PyLucene install: "make" not working and "jvm.dll could not be found"

I'm trying to get PyLucene going on my computer and have followed all of the steps but am getting stuck at the "make" step. I'm working on 64-bit Windows 10 Machine and my version of Python is 3.9.12. What I've done: 1.) Install Apache Ant and set…
Michael S.
  • 3,050
  • 4
  • 19
  • 34
1
vote
1 answer

Problem in Ping or SSH connect to docker container

I am new to docker. I installed docker on Windows 10. Also, I have installed SSH on my win. I have a pylucene docker container. When I check the status of my docker container, it is Up. PS C:\Windows\system32> docker ps -a CONTAINER ID IMAGE …
Coder Guy
  • 31
  • 3
1
vote
1 answer

How to get a list of all tokens from Lucene 8.6.1 index using PyLucene?

I have got some direction from this question. I first make the index like below. import lucene from org.apache.lucene.analysis.standard import StandardAnalyzer from org.apache.lucene.index import IndexWriterConfig, IndexWriter, DirectoryReader from…
PSK
  • 347
  • 2
  • 13
1 2
3
8 9