0

I would like to create an automatic synonym finder algorithm (mostly for brand names). For example, if the user enters the word "Coca cola", I would like to return the word "Coke".

This can easily be done using dictoniaries; but than I need a dictonary file for that. How can I automaticly create one? (I have a small corpus with 1 million+ records, the synonyms will come from this).

Are there any text-mining algorithms for this problems? Also, there are some online synonym services for this, but these contain no brand names :-(

Thanks, krisy

krisy
  • 1,508
  • 1
  • 14
  • 32

1 Answers1

0

A trie based algorithm can find synonyms or similar words. It's not using keys but position in the tree to define the key. For example from A you go to B and from B you go to C. There are many data structures for a trie. The most simple is an adjaceny list or a ternary search tree.

Micromega
  • 12,486
  • 7
  • 35
  • 72