We are planning to develop a system to convert speech obtained from phone calls to text using sphinx4.
The format of such phone calls is
Type: Audio
Codec: PCM ALAW
Channels: Mono
Sample Rate: 8kHz BitRate: 8 bits per second
In the tutorial it says
If you are using sound files with a sampling rate of 8KHz (telephone audio), you need to change some values in etc/sphinx_train.cfg
Are there any other changes to be done apart from this?
Is it possible to develop a system for bit rate of 8 bits/sec because in the tutorial it says
“It's critical to have audio files in a specific format. Sphinxtrain does support some variety of sample rates but by default it's configured to train from16khz 16bit mono files in MS WAV format.”