Questions tagged [whoosh]

Whoosh is a fast, featureful, full-text indexing and searching library implemented in pure Python.

Fast, pure-Python, full text indexing, search and spell checking library. Whoosh on the Python Package Index

Whoosh Documentation

373 questions
2
votes
1 answer

out of index in python

I'm trying to make indexing to an excel file and i used whoosh package but, i found an error that the list index is out of range. please, can anyone help me? my code is: from whoosh import fields, index import os.path import csv import codecs #…
2
votes
1 answer

Django haystack, how to match parts of words?

I'm using haystack 1.2.7 + whoosh 2.4.0 in Django 1.4 (Python is 2.7) Example: Search query "sear" should match items containing "search" and "sear" and "searching" (etc). my settings: HAYSTACK_SITECONF =…
1
vote
2 answers

More than one index field in self.prepared_data

I'm using Haystack and Whoosh with Django and I was trying to create the indexes through the code below: class LivroIndex(SearchIndex): text = CharField(document=True, use_template=True) autor = CharField(model_attr='Autor') titulo = …
user1106551
1
vote
1 answer

Whoosh, Haystack, and AttributeError: 'Segment' object has no attribute 'compound' in production only

I'm using Whoosh 2.3.2, Haystack 2.0.0, and Django 1.3 under Python 2.7. On Localhost, it works great. In production, it explodes when I search. Here's the full traceback: File…
Jason Goldstein
  • 1,117
  • 2
  • 11
  • 20
1
vote
1 answer

Whoosh Query Composition for docs containing 'most' terms

I've just started using Whoosh and noticed that queries seem to have logic applied to each term such as AND([term1, term2, ...]) or OR([term1, term2, ...]). My problem is that I want to include documents that include most of the terms in my search…
Trindaz
  • 17,029
  • 21
  • 82
  • 111
1
vote
1 answer

Python: Whoosh search for a non-exact query

Is it possible to use Whoosh to search for documents that do not exactly match the query, but are very close to it? For example, only one word is missing in the query to find something. I wrote a simple code that works if the query covers all…
yoloven
  • 11
  • 4
1
vote
1 answer

highlight search query full result in whoosh

I have a query t1 = query.Term("content", "field") t2 = query.Term("content", "information") t3 = query.Term("content", "document") q = spans.SpanNear2([t1, t2, t3], slop=5, ordered=True) finds and marks individual words. [information]…
arman
  • 141
  • 1
  • 11
1
vote
1 answer

nginx + django + haystack = Server Error (500)

I've cobbled together a small blog application in Django using Haystack with Whoosh backend for search. It runs nicely in development server (on the laptop) but search fails when site runs in nginx on a server (rpi). I can access search page but any…
erg
  • 25
  • 1
  • 5
1
vote
1 answer

Document search in Lucene/Solr, Whoosh, Sphinx, Xapian

I am comparing Lucene/Solr, Whoosh, Sphinx and Xapian for searching documents in DOC, DOCX, HTML and PDF. Only Solr is documented to have a document parser (Tika) which directly indexes documents. So it seems a clear winner. But to level the playing…
Jesvin Jose
  • 22,498
  • 32
  • 109
  • 202
1
vote
1 answer

Text indexers (for python) with inbuilt support for doc, docx and pdf files

I am currently on the lookout for a text indexer for my python program. I shortlisted Solr, a Lucene project and Whoosh, which is native to python. I searched a lot of documentation on support for doc, docx and pdf files, and Solr kept pointing me…
Jesvin Jose
  • 22,498
  • 32
  • 109
  • 202
1
vote
1 answer

The whoosh can not search the key word successfully

1.I'm writing a very simple whoosh project. Firstly, I read a txt file and use read() method to get all the contents in the txt file. Then build an index for this content. 2.Here is the code for implementation: for the txt file content: #import…
Jay Park
  • 308
  • 1
  • 6
  • 14
1
vote
1 answer

Add whoosh search engine to local html files

I have multiple HTML files that are all stored locally. I wrote a Python script with Whoosh to create an index to search the content of these files. I also have a Python script with Whoosh code to search this index based on a query. Now I want to…
Yasmina
  • 51
  • 3
1
vote
1 answer

How to return the corresponding line that matches our search keyword in whoosh?

Lets say given file a.txt: hello world good morning world good night world Given the keyword that I want to search is morning, I want to use whoosh python library to return the line that matches the keyword morning in the text file a.txt. So, it…
Victor Vic
  • 23
  • 3
1
vote
0 answers

Fuzzy prefix query with Whoosh

I'm trying to implement a prefix fuzzy query with Whoosh - autocomplete-style search that allows typos. For example, I would like the following queries to match the document "hello world": "hello wo" (prefix search) "hellp" (fuzzy search, distance…
1
vote
1 answer

django-cms-search not displaying any results

Hey I have been plugging away trying to get haystack working with whoosh to implement a search backend for a django cms project i have been working on. After figuring out some really weird permissions errors i can almost taste sucess. I now have…
Mike Waites
  • 1,688
  • 3
  • 19
  • 26