Audio language identification

Question

I'm looking for an open source library to detect the spoken language used in an audio file, such as a wav file.

I tried CMU sphinx , but was not able to understand how to use it for language detection? Can someone please help?

score 2 · Answer 1 · answered Oct 17 '14 at 01:32

If you are willing to learn another toolkit, you should consider Kaldi [1]. It is an open-source speech recognition toolkit with a speaker recognition system (which uses similar models as a language identification system) in the trunk and an experimental language Identification setup in the sandbox language_id. After checking out the repository, you can switch to the LID sandbox with svn switch ^/sandbox/language_id. The LID examples are in egs/lre07.

Whichever toolkit you use, I recommend an i-Vector based system instead of a phonotactic system. An i-Vector based system will be easier to setup as it doesn't require transcripts, and significantly faster, since it avoids decoding.

score 1 · Answer 2 · answered Jun 11 '14 at 18:06

1

You can try CMU sphinx in all phone decode mode. Train models for languages you wish to identify. Pick language for which hypothesis score is best

answered Jun 11 '14 at 18:06

PGupta

205
3
13

Audio language identification

2 Answers2