Questions tagged [relevance]

Search Relevance is the practice of manipulating a search application's behavior to improve the quality of the search results based on the needs of the application's users

Search Relevance is the practice of improving the quality of search results in a search application. In relevance work, you line up a user audiences needs (as expressed in their natural language search string) with the content available in the search application.

For example, a customer typing the search "cancer" into a hospital site's search is likely to have very different expectations than doctors typing that term into a medical research site. In the hospital case, users want a "info desk" search user experience, and are likely interested in the hospital's cancer services. For doctors searching a research site, they're more likely to be interested in the latest cancer research.

With search relevance, you work to understand the user's needs and line up the search application's ranking behavior to match. Relevance work involves technical work to manipulate the ranking behavior of a commercial or open source search engine like Solr, Elasticsearch, Endeca, Algolia, etc. This means manipulating field weightings, query formulations, text analysis, and more complex search engine capabilities. It may also mean leveraging user behavioral data, NLP, statistical, and other machine learning techniques to modify or enrich the behavior of such a search engine.

More Reading

Blog

Books

273 questions
1
vote
1 answer

Mixing results from different Lucene sources

We have multiple indexes using lucene and we do a parallel fetch of results and show them in different search result pages. We want to unify them as a single mixed set results sorted by relevance. Is there any standard method to mix results of…
Alagu
  • 2,864
  • 3
  • 26
  • 40
1
vote
1 answer

Is there an algorithm for determining the relevance of a text to a theme?

I want to know what can be used to determine the relevance of a page for a theme like games, movies, etc. Is there some research in this area or is there only counting how many times some relevant words appear?
Renato Dinhani
  • 35,057
  • 55
  • 139
  • 199
1
vote
1 answer

An issue in sorting the items by relevance in Lucene 2.9

We are using Lucene.NET Version 2.9 to search for items using free text query. We get the items automatically sorted by Lucene by relevance. It has been working great except for one odd case. When search for Agile project management, the top 4 items…
dotcoder
  • 2,828
  • 10
  • 34
  • 50
1
vote
1 answer

How do I sort search results by relevance?

I'm working on a project which searches through a database, then sorts the search results by relevance, according to a string the user inputs. I think my current search is fairly decent, but the comparator I wrote to sort the results by relevance is…
Salmontres
  • 33
  • 5
1
vote
0 answers

Does Elasticsearch assign relevance to text fields based on their format? For example, heading, link, list item, bold, underline?

Intuitively when searching a document the match (similarity score) should take into account formatting "hints", such as heading (higher = more important), link, list item, bold, underline etc., vs. simple text. Does Elasticsearch do that? If so…
Shaul Dar
  • 881
  • 3
  • 10
  • 16
1
vote
1 answer

MarkLogic relevance scoring calculations using logtfidf method

I'm trying to find how MarkLogic calculates relevancy score. MarkLogic support pointed me to a knowledge base article (link in reference) where I saw the below formula (natural log). log(1/term frequency) * log(1/document frequency) When I apply…
P K
  • 162
  • 12
1
vote
1 answer

Any ideas of what more web page meta information I can use to classify a page relevance for some theme?

I'm doing an algorithm to classify the relevance of a page for some theme like 'movies' using all meta information as possible, but excluding the textual content of the body. I want to know what can I use to determine if a page has some info about…
Renato Dinhani
  • 35,057
  • 55
  • 139
  • 199
1
vote
0 answers

Asking about sorting or ordering results from a search engine by relevancy in php/Mysql

I have a search engine based on multiple select options that the user can enter a lot of option and the search engine fetch the database to get the the results of arrays that entered in the the select options but I need to make this code to fetch…
1
vote
2 answers

Pooling Method in TREC competitions

This is a very fundamental and silly doubt. I have read that in order to prevent large relevance assessments in TREC competitions (reference), the top-ranked documents returned by participating systems are pooled to create the set of documents for…
1
vote
1 answer

Why does Solr changes record position after updating a field

I am new to Solr and encountered a weird behavior as I update a field and perform search. Here's the scenario : I have a 300records in my core, I have a search query wherein I filtered the results with this fq=IsSoldHidden:false AND IsDeleted:false…
venalyn sudaria
  • 149
  • 1
  • 11
1
vote
3 answers

How to sort keyword search results by hit frequency?

I've built a search index (two column table) which assigns single keywords to their content ID. Now searching for single keywords works and is damn fast. However, searching for multiple keywords like SELECT media_id, keyword FROM search_index WHERE…
Cobra_Fast
  • 15,671
  • 8
  • 57
  • 102
1
vote
1 answer

Search for two fields but only score once in Elasticsearch

Let's say I have these documents in Elasticsearch: { "display_name": "Jose Cummings", "username": "josecummings" }, { "display_name": "Jose Ramirez", "username": "elite_gamer" }, { "display_name": "Lance Abrams", "username":…
Josh
  • 8,079
  • 3
  • 24
  • 49
1
vote
1 answer

Removing boost term from scoring in elasticsearch

Is there some way, that I can remove default boost term from elastic relevance scoring or make it 1(so it does not reflect in scoring). for example, edit: input query is GET test_index/_search { "query": { "match": { "text": "asia…
1
vote
1 answer

ngram matching gives same score to less relevant documents

I am searching for Bob Smith in my elasticsearch index. The results Bob Smith and Bobbi Smith both come back in the response with the same score. I want Bob Smith to have a higher score so that it appears first in my result set. Why are the scores…
GNG
  • 1,341
  • 2
  • 23
  • 50
1
vote
1 answer

Algorithm for relevance/matching tree of value

Is there a name of the pattern/algorithm for what I'm trying to describe below?... Say you have a tree of relevance-data like this: IDEs Visual Studio Visual Studio 2008 Visual Studio 2010 Eclipse Then I have an object that contains a…
Seb Nilsson
  • 26,200
  • 30
  • 103
  • 130