I've been asked to guess the user intention when part of expected data is missing. For example if I'm looking to get very well
or not very well
but I get only not
instead, then I should flag it as not very well
.
The Levenshtein distance for not
and very well
is 9
and the distance for not
and not very well
is 10
. I think I'm actually trying to drive a screw with a wrench, but we have already agreed in our team to use Levenshtein for this case.
As you have seen the problem above, is there anyway if I can make some sense out of it by changing the insertion, replacement and deletion costs?
P.S. I'm not looking for a hack for this particular example. I want something that generally works as expected and outputs a better result in these cases also.