We are confronting different search engines for our research archives and having browsed the Xapian-Omega documentation, we decided to try it out since the Omega option appears to be an appropriate solution with several interesting search options.
We installed Xapian-Omega on a Linux Server (Deb 7) and tested the setup with success. However we are unsure as to how one can employ or perhaps even enable the use of Wild Cards or Regular Expressions with Xapian-Omega.
We read that for Xapian one has to enable the Wild Card option "QueryParser flags" Could someone clarify this ? ie. explain with or indicate a page with an example or two.
But we did not see much information regarding examples with Omega CGI and although this latter runs well, wild card options (such as * for the general wild card and ? as a single character), do not seem to work as expected by default and they would be useful, even though stemming and substrings etc may be functional.
Eg: It would be interesting to be able to employ standard simple wild char searches with a certain precision such as : medic* for medicine medical medicament or with ? for single characters
Can Regexp be recognised with Omega ? eg : sep[ae]r[ae]te(\w+)? or searching for structured formats such as Email or Credit Card Numbers or certain formula types in research papers etc.
In a note from Olly Betts long ago (Dev Mailing List) regarding this one suggestion was to grep the index file but this would defeat the RAD advantage of Omega.
Any examples of searches using Omega with Wild Cards or Regular Expressions would be most appreciated ... even an indication of a page where information regarding this theme is well presented with examples illustrating how to develop advanced searches using Xapian alone would be most welcome (PHP or Python perhaps).
(We are not concerned for the moment about the eventual substantial increase in the size of the index size or in the time to index the archive)