Questions tagged [whoosh]

Whoosh is a fast, featureful, full-text indexing and searching library implemented in pure Python.

Fast, pure-Python, full text indexing, search and spell checking library. Whoosh on the Python Package Index

Whoosh Documentation

373 questions
1
vote
2 answers

Django+Haystack: Exception Value: 'module' object has no attribute 'get_model'

I set up a very simple project to try out Haystack using a Whoosh engine, mostly following the example in the documentation. I installed everything using pip and no version numbers, so I should have the latest release versions. I'm getting this…
Sushil
  • 371
  • 1
  • 3
  • 11
1
vote
0 answers

Not able to search non-english(hindi) words in whoosh?

I have json dumps from api which is in Hindi. First I got these json files: def retrieve_data(): '''Get articles from api and save locally.''' i = 1 while True: articles = get_articles(page_no=i) if not articles or…
Rahul Shrivastava
  • 1,391
  • 3
  • 14
  • 38
1
vote
1 answer

documents not being indexed after whoosh StemmingAnalyzer cache change

per whoosh documentation here , giving the StemmingAnalyzer an unbounded cache makes batch indexing faster: writer = myindex.writer() # Get the analyzer object from a text field stem_ana = writer.schema["content"].format.analyzer # Set the cachesize…
Hakim
  • 1,242
  • 1
  • 10
  • 22
1
vote
2 answers

How do I get the bag-of-words representation of document content with Whoosh?

I have an index schema like the following: schema = Schema( title=TEXT(stored=True), content=TEXT, id=ID, topicID=NUMERIC, ) I first get documents for topic t using searcher.documents(topicID=t). This returns hits. I'd like to get…
Matt
  • 303
  • 1
  • 2
  • 16
1
vote
2 answers

Efficiently get n random documents from a Whoosh index

Given a large Whoosh index, how can I efficiently retrieve n random documents from it? I can do this horribly inefficiently just by pulling all the documents into memory and using…
Mark Amery
  • 143,130
  • 81
  • 406
  • 459
1
vote
1 answer

Django 1.9/Haystack 2.4.1 "Model could not be found for SearchResult"

Let me just first say, I have tried the fixes here: Haystack says “Model could not be found for SearchResult” and I'm still getting Model could not be found for SearchResult ''. I'm on Django 1.9 &…
1
vote
1 answer

Seach index not working when pushing new version of app

I am using django-haystack and whoosh search engine in my django app. Everything is working alright, except when I git push new version to my OpenShift server, search stops working. It simply does not return any results. If I run ./manage.py…
Lucas03
  • 2,267
  • 2
  • 32
  • 60
1
vote
1 answer

How to get highlighted searches on whoosh

I used an example code from pythonhosted.org but nothing seems to happen. This is code I used: results = mysearcher.search(myquery) for hit in results: print(hit["title"]) I entered this code into python but it gives an error saying mysearcher…
JayJay Simpson
  • 155
  • 3
  • 15
1
vote
1 answer

Django haystack indexing many to many field but search failing if there are more than one word

I have followed this link-->Django Haystack and Taggit and had indexed the fields using haystack and whoosh as the search engine. Search failed if the many to many field had more than one word as the attribute value. The indexing is done as --> >>>…
Subhajit
  • 361
  • 1
  • 4
  • 18
1
vote
2 answers

Using WhooshAlchemy with joined table inheritance

I currently have something along the lines of this (taken from the SQLAlchemy documentation): class Employee(Base): __tablename__ = 'employee' id = Column(Integer, primary_key=True) name = Column(String(50)) type =…
Encrylize
  • 95
  • 6
1
vote
1 answer

Writer.add_document() function error Whoosh - mysql loop

I'm trying to index a large amount of articles from a db which is encoded in latin1. I've solved the encoding issue with charset, but I not able to add the to the index each row. I've tried : 1) writer.add_document(Id = unicode(row["Id"]),Body =…
Claudia Guirao
  • 335
  • 1
  • 3
  • 10
1
vote
0 answers

FeinCMS search rich text content data

I am working on an existing Django project that uses FeinCMS for the content management. I am trying to implement a 'Site Search'. Using Haystack and Whoosh I can happily search the standard Django models. However, when I try to use the same…
Ric
  • 57
  • 1
  • 7
1
vote
0 answers

Django Haystack- two input fields

I've a haystack search model and whoosh search engine working well on below model: class Business(models.Model): name = models.CharField(max_length=64) address = models.CharField(max_length=64, blank=True, null=True) district =…
MOHOSHA
  • 99
  • 8
1
vote
1 answer

google-app-engine full-text-search ,which is better , "google custom search" or whoosh

this is whoosh so ,did you know which is better ? thanks
zjm1126
  • 63,397
  • 81
  • 173
  • 221
1
vote
2 answers

How to create an index using Whoosh

I am trying to use Whoosh for text searching for the first time. I want to search for documents containing the word "XML". But because I am new to Whoosh, I just wrote a program that search for a word from a document. Where the document is a…
user3422243
  • 87
  • 2
  • 6