First of all I'd like to state that my question is not per say about the "classic" definition of voice recognition.
What we are trying to do is somewhat different, in the sense of:
- User records his command
- Later, when the user will speak pre-recorded command, a certain action will occur.
For example, I record a voice command for calling my mom, so I click on her and say "Mom". Then when I use the program and say "Mom", it will automatically call her.
How would I perform the comparison of a spoken command to a saved voice sample?
EDIT: We have no need for any "text-to-speech" abilities, solely a comparison of sound signals. Obviously we're looking for some sort of a off-the-shelf product or framework.