I am creating a schema for Vespa mainly for English, but with two fields in Wylie transliteration of Tibetan, which looks like this
'jam dpal smra ba'i seng ge la bstod pa ut+pal dmar po'i do shal
Typically users want to match every token and preserve the word order, and preferably in the beginning of the field.
For example, to find the field above, user might enter "'jam dpal smra ba'i seng ge". They would not appreciate results where these tokens would appear in different order, even if that would rank high with BM25. BM25 would still be needed for fallback.
Could you give me an example of the schema field / ranking expression to rank in this order:
- exact match in the beginning of field
- exact match anywhere
- bm25
Naturally, I'll turn off stemming. Also, apostrophes and, less importantly, plus signs should be preserved.
I have read especially the Schema Reference of Vespa docs, but I did not find a solution.