Questions tagged [whoosh]

Whoosh is a fast, featureful, full-text indexing and searching library implemented in pure Python.

Fast, pure-Python, full text indexing, search and spell checking library. Whoosh on the Python Package Index

Whoosh Documentation

373 questions
1
vote
1 answer

How to include an apostrophe (') in a Whoosh query?

How can I include an apostrophe (') in a Whoosh query? For example (that's): tws_fileName.tws_query_index( 'that's' ) because the query mentioned above does not work, neither does the following tws_fileName.tws_query_index( "that's" )
A Alw
  • 41
  • 1
  • 6
1
vote
1 answer

Integer too large error with vectoring during whoosh indexing

I am trying to use Whoosh to index a large corpus (roughly 25 million academic abstracts + titles). I marked the "abstract" field with vector=True because I need to be able to compute high scoring key terms based on the abstracts for similarity IR.…
1
vote
1 answer

django haystack whoosh not showing any errors also no results

I am trying to django-haystack whoosh. Django Haystack & Whoosh search working but in page no giving result. I have looked up this similar question too Django-haystack-whoosh is giving no results Django Haystack & Whoosh Search Working, But…
Nil Sagor
  • 333
  • 1
  • 8
  • 19
1
vote
1 answer

Match a query within token in Whoosh

I want to apply a search with Whoosh on a text. Right now this works only for exact matches of tokens (space delimited). I'd like to match also within a token (e.g.: match add in a token "added"). I know about stemming and variations, but this are…
1
vote
1 answer

NLP Phrase Search in Python

I have been going through many Libraries like whoosh/nltk and concepts like word net. However I am unable to tackle my problem. I am not sure if I can find a library for this or I have to build this using the above mentioned resources. Question: My…
1
vote
0 answers

Django-Haystack-Whoosh is giving no results

I'm trying to use Haystack-Whoosh for search in a Django application. I've implemented code same as mentioned in documentation page: django-haystack documentaation but still it is not working no search results were filtered. Here is my…
Devya
  • 11
  • 3
1
vote
1 answer

Looking solutions for performing full text search with low memory cost

I wrote a web application using Django to display products information. My database (MySQL) contains ~200000 records ( fields: brand, name, and img). I want to add a search function to my app (the search is performed on brand and name fields). I…
cityCoder
  • 65
  • 1
  • 7
1
vote
1 answer

Creating custom analyzers using whoosh

I am trying to implement a semantic search engine with deep NLP pipeline using Whoosh. Currently, I just have stemming analyzer, but I need to add lemmatizing and pos tagging to my analyzers. schema = Schema(id=ID(stored=True, unique=True),…
Shruti h
  • 37
  • 8
1
vote
0 answers

Search sqlalchemy database with whooshalchemy

Alright, so I'm trying to allow a user to search for a movie/TV Show in the database I've set up using sqlalchemy. I followed along with a tutorial and was able to print all the movies out to the screen from my database using this code:
humbleCoder
  • 463
  • 1
  • 5
  • 18
1
vote
1 answer

NameError: name 'base' is not defined(haystack search)

I wanted to include full text search in my django application. I am using whoosh-haystack for this.When I include whoosh and haystack in my installed apps,and execute the command ./manage.py, I am getting an import error. Can anyone sort this…
1
vote
1 answer

IndexError: list index out of range (In whoosh Search Engine library) error at

I am trying to Create index by whoosh of 150 MB file. But it is showing the error list index out of range: I have quote the line which is responsible of error. That is for x in range(len(id)):. Logically Index record will be equivalent to ID number…
Abhishek Kaushik
  • 93
  • 1
  • 2
  • 12
1
vote
1 answer

How to read number of terms in the index created by whoosh library in python?

The whoosh library automatically creates 3 files after the commit operation and I am not able to find any way to read the seg file
1
vote
0 answers

Multiprocessing Pool() equivalent for non-picklable objects

Short Is there an equivalent to the multiprocessing.Pool()'s map function for non-picklable objects in python 3? Or do I misunderstand this error? : TypeError: can't pickle memoryview objects Long I have a large whoosh index (text indexing and…
ted
  • 13,596
  • 9
  • 65
  • 107
1
vote
1 answer

Index related table using Haystack/Whoosh

How can I index a related table: class Foo(models.Model): name = models.CharField(max_length=50) Class FooImg(models.Model): image = models.ImageField(upload_to='img/', default = 'img/no-img.jpg', …
almost a beginner
  • 1,622
  • 2
  • 20
  • 41
1
vote
1 answer

How to do spell check using whoosh python library?

How to perform spell checking using whoosh library? I have added some code which is there in documentation. but it is not correcting words. Please find my code. def main(): print " Hi" schema = Schema(title=TEXT(stored=True),…