I am trying to measure the similarity of company names, however I am having difficulties while I'm trying to match the abbreviations for those names. For example:
IBM
The International Business Machines Corporation
I have tried using fuzzywuzzy
to measure the similarity:
>>> fuzz.partial_ratio("IBM","The International Business Machines Corporation")
33
>>> fuzz.partial_ratio("General Electric","GE Company")
20
>>> fuzz.partial_ratio("LTCG Holdings Corp","Long Term Care Group Inc")
39
>>> fuzz.partial_ratio("Young Innovations Inc","YI LLC")
33
Do you know any techniques to measure a higher similarity for such abbreviations?