Questions tagged [relevance]

Search Relevance is the practice of manipulating a search application's behavior to improve the quality of the search results based on the needs of the application's users

Search Relevance is the practice of improving the quality of search results in a search application. In relevance work, you line up a user audiences needs (as expressed in their natural language search string) with the content available in the search application.

For example, a customer typing the search "cancer" into a hospital site's search is likely to have very different expectations than doctors typing that term into a medical research site. In the hospital case, users want a "info desk" search user experience, and are likely interested in the hospital's cancer services. For doctors searching a research site, they're more likely to be interested in the latest cancer research.

With search relevance, you work to understand the user's needs and line up the search application's ranking behavior to match. Relevance work involves technical work to manipulate the ranking behavior of a commercial or open source search engine like Solr, Elasticsearch, Endeca, Algolia, etc. This means manipulating field weightings, query formulations, text analysis, and more complex search engine capabilities. It may also mean leveraging user behavioral data, NLP, statistical, and other machine learning techniques to modify or enrich the behavior of such a search engine.

More Reading

Blog

Books

273 questions
0
votes
2 answers

Limit number of results by Max Score's 80% and above(Please Check Body) in SOLR

Lets say my MaxScore is 500. Then I want results with 500's 80% and above Score results only . How can I achieve that?
0
votes
1 answer

How to make shorter (closer) token match more relevant? (edge_ngram)

I'm getting weird results with edge_ngram tokenizer I'm using for autocomplete. I'm trying to figure out how to make my results more relevant. I copied the example from the elasticsearch documentation. I have documents with the following…
orszaczky
  • 13,301
  • 8
  • 47
  • 54
0
votes
0 answers

How to boost a field in solr as described in detail below for relevance sorting by score

I have few fields indexed as title, bookname, description, author, topic and others and also there is one field "TiBoDe" which contains all values from title, bookname, description. Now if I do like TiBoDe:game it will not give relevant results So I…
0
votes
1 answer

relevance models

The relevance model just estimates the relevance feedback based on feedback documents. In this case, the relevance model would have a higher probability of getting common words as its feedbacks. Thus I assumed the performance of the relevance model…
user19283043
  • 327
  • 6
  • 17
0
votes
2 answers

Boost solr results with one word match from multiValued

I have solr schema with certain fields:
striker
  • 1,253
  • 3
  • 15
  • 25
0
votes
1 answer

Architecture design to find the most relevant symptoms given an input symptom

My goal is to build a symptom recommendation system I have 3 columns of data in my excel. Patient id Symptoms Disease detection For each patient id there is one or more than one symptom that leads to a disease detection. My goal is to find the…
Lalit
  • 79
  • 6
0
votes
1 answer

Assign a higher score to matches containing the search query at an earlier position in elasticsearch

This question is similar to my other question enter link description here which Val answered. I have an index containing 3 documents. { "firstname": "Anne", "lastname": "Borg", } { "firstname":…
GNG
  • 1,341
  • 2
  • 23
  • 50
0
votes
2 answers

How can I get elasticsearch to assign a higher score to strings of tokens that are in the matching order?

I'm building a search database. Most entries are proper nouns (names and street addresses). I set up an ngram token filter to help with fast fuzzy searching. It works well. However, if I search for "John Allen", the results include "John Allen" and…
GNG
  • 1,341
  • 2
  • 23
  • 50
0
votes
1 answer

What vector space is Rocchio algorithm computed in?

I have been trying to implement the Rocchio algorithm and I understand the basic idea behind the algorithm but I struggle to put it into concrete terms. I calculated tf_idf before and that is a vector of length of the number of query terms we search…
vcucu
  • 184
  • 3
  • 12
0
votes
1 answer

How does calculating relevance scoring in Elasticsearch differ from Couchbase?

I wonder whether relevance score in elasticsearch has differences with couchbase or not?
nhtrung
  • 55
  • 7
0
votes
1 answer

Relevance percentage( searchstring is more "left" in sourcestring)

Can anybody help with selecting of algorithm. To compare two strings and give a relevance percentage(if more left found, more rank). maybe merge two algorithms for example: looking for "chocolate white quills" we have records CHOCOLATE, WHITE/DARK…
Katya
  • 13
  • 4
0
votes
0 answers

Is there a way of boosting sub-sections of long documents in Lucene/SOLR?

I run a search engine that specialises in clinical documents. Most are abstracts, perhaps 250-350 words. One issue, which is a real pain, is searching over guidelines.  They are typically long (1000+ words) and have short titles. So, the title…
JRBTrip
  • 11
  • 1
0
votes
2 answers

MySQL query ordering off by 1?

The query: select title from gamelist where match (title) against ('super mario luigi'); Returns: +-----------------------------------+ | title | +-----------------------------------+ | Super Mario and Luigi …
Josh
  • 657
  • 3
  • 15
  • 30
0
votes
0 answers

Elasticsearch fuzzy query hit is more relevant than direct hit

I am having problems with fuzzy queries giving higher relavance to the fuzzy hit term than the direct match. I dont understand why - even after having rtfm. Creating a new index curl -XPUT http://192.168.178.40:9200/test_test_test Inserting…
Michael Leiss
  • 5,395
  • 3
  • 21
  • 27
0
votes
0 answers

Mysql Full Text Match Returns Bool Match but 0 Relevance

I'm having some behavior I've not experienced before with Mysql's full text indexing. The relevance for a search with cell comes back as 0 for all records. The in boolean result though verifies the term is present (and the where clause also). select…
user3783243
  • 5,368
  • 5
  • 22
  • 41