I am trying to use phonetic algorithms like Soundex and/or Metaphone to generate words that sound similar to a given dictionary word. Do I have to have a corpus of all dictionary words for doing that? Is there another way to generate words that sound similar to a given word without using a corpus? I am trying to do it in Python.
Asked
Active
Viewed 1,217 times
1 Answers
2
If you don't use a corpus, then you will probably have to manually define a set of rules to split a word in phonetic parts and then find the list of close phonemes. This can generate similar sounding words but most won't exist. If you want to generate close sounding words that exist, then you necessarily need a corpus.
You didn't precise the goal of your task, but you may be interested in the works of Will Leben "Sounder I" (and II and III) and Jabberwocky sentences.

gaborous
- 15,832
- 10
- 83
- 102
-
1I will mostly end up using the Oxford University Press corpus. I have submitted a request to them. The articles that you have mentioned are super interesting. Thanks for your time! – user2832492 Jun 23 '15 at 18:22