I have a list A with strings:
['assembly eye tow top', 'tow eye bolts', 'tow eye bolts need me']
I am trying to find a string strA
that has the highest partial match score with all the strings in list A.
In other words, create a string that contains common tokens AND tokens that is present in most of the strings!
i.e. strA = 'tow eye bolts'
I tried the following:
- Common substring (would not work since string needs to be contigous and has to maintain order)
- Common subsequence (would not work since string needs to maintain order)
- Find collocations (I do not know how to implement this for the desired output)
- Use Python's fuzzywuzzy (I tried this but this only finds similarity scores between two strings)