I want to ask regarding the meaning of 3-state phone model in HMM. This case is based on the theory of HMM in speech recognition system. So the example is based on the acoustic modeling of the speech sounds in HMM.
I get this example picture from a journal paper: http://www.intechopen.com/source/html/41188/media/image8_w.jpg
Figure 1: 3-State HMM for the sound /s/
So, my question is:
- what is it mean by 3 state?
- what actually S1, S2 & S3 mean? (I know it is state but it represent what?)
- How to represent the /s/ sound in this HMM state?
- Why is it 3? what happen if we have 4, 5 or more state?
- If the sound of /s/ is only a simple sound of consonant "s/", what is the used of the state and transition represent?
Do you guys have simple explanation with example (graphic analogy) of this theory?
Thank you
Nick