Questions tagged [cmusphinx]

An open-source toolkit for speech recognition. Includes speech recognizers Sphinx 2-4, PocketSphinx and a set of tools to supplement the recognizers.

CMU Sphinx is a group of speech recognition systems developed at Carnegie Mellon University.

Links:

682 questions
3
votes
2 answers

How to combine speech recognition and speaker diarization?

I am trying to combine speech recognition and speaker diarization techniques to identify how many speakers are present in an conversation and which speaker said what. For this I am using CMU Sphinx and LIUM Speaker Diarization. I am able to run…
Shekhar
  • 11,438
  • 36
  • 130
  • 186
3
votes
1 answer

How can i write JSGF grammar file with unknown in advance word?

How can i write JSGF grammar file with unknown in advance word? For example i want match "Give me the chair please", "Give me the book please", etc. using similar construction public = Give me please; How can i do this?
sagus_helgy
  • 1,417
  • 1
  • 18
  • 30
3
votes
1 answer

Continuous speech recognition pocketsphinx from microphone

how to perform continuous speech recognition from microphone using pocketsphinx .how to use gstreamer plugin api in C ?
user1462684
  • 91
  • 2
  • 9
3
votes
1 answer

How to reliably reject out-of-gramar words in PocketSphinx

I am using PocketsphinxDemo successfully on Android. I have a very limited set of words to be recognized, say 3 different words. I currently get lots of false positives - while talking, Pocketsphinx recognizes one of my 3 words although they have…
Hokascha
  • 1,709
  • 1
  • 23
  • 41
3
votes
2 answers

Extending Sphinx to support a local language

Initially i was thinking of writing a speech recognition engine from scratch(with support of 50-100 words)to support my native language. However after some research it has become clear that it is not possible to make a speech recognition engine even…
Win Coder
  • 6,628
  • 11
  • 54
  • 81
3
votes
1 answer

Tuning CMU Sphinx

I am trying to build a speech recognition app based on CMU Sphinx. I have created my own language model using the lmtool. But inorder to improve the accuracy of recognition, I want to tune Sphinx. But are there any guidelines for choosing the…
Shishya
  • 1,069
  • 1
  • 14
  • 22
3
votes
1 answer

Low accuracy of Shpinx 4 keyword spotting function caused by converting wav to cepstrum

I am trying to build a keyword spotting system, and I choose this,a branch of sphinx 4 , as the foundation of my project. It works properly with wav file (at least 70% with single keyword). But to save time in transmitting files from client to…
nh3500
  • 33
  • 6
3
votes
2 answers

CMU Sphinx4 phoneme dictation

How can I configure sphinx4 to be able to detect only phonemes in a dictation? I've already read about partial results "You can control how often the result listener is fired by setting the configuration variable 'featureBlockSize' in the…
Marl
  • 1,492
  • 2
  • 22
  • 37
3
votes
2 answers

Open-source tool for gender-recognition using voice

Good evening, I'm working on a project with Kinect and I need to perform a gender-recognition using the voice information caught by the microphone of the sensor. The recognition could also be text-based, i.e. the sentence said by the user could…
Seul Ement
  • 225
  • 2
  • 11
2
votes
2 answers

CMU-Sphinx : NullPointerException at recognizer.allocate()

I've been trying to set up CMU-Sphinx4 on my PC in order to make an application using speech Recognition. I was able to setup most of the parts and configuration and trying to run the HelloWorld application provided by Sphinx, but stuck with…
2
votes
1 answer

How to choose pocketSphinx's language model in python

I just need a speech recognition library in linux. Then I choose CMU Sphinx, and use pocketSphinx with python. I test that demo. This example runs well in my computer with poor accuracy. I know that I need a language model, and I have downloaded…
madper
  • 806
  • 1
  • 10
  • 25
2
votes
2 answers

Speech recognition response is poor in sphinx4

Currently we are investigating into using sphinx4 for speech recognition. We are trying to achieve a good response for a dictation type application. The input is a wav file and we wish to transcribe it. I have looked into the LatticeDemo and…
Raveesh Sharma
  • 1,486
  • 5
  • 21
  • 38
2
votes
1 answer

How to use arpa file in voice recognition

I have created a ARPA file from a text file using CMU SLM toolkit. Currently I don't know how to use the generated ARPA file in my project instead of .lm and .dic file. If any one knows about that please let me know.
ravoorinandan
  • 783
  • 2
  • 10
  • 26
2
votes
1 answer

Installing Pocketsphinx on Windows 10

I am trying to install pocketsphinx on Windows 10. I have downloaded pocketsphinx and sphinxbase, built them both, and put sphinxbase.dll into the pocketsphinx bin. However when I copy and paste the command…
pilars32
  • 41
  • 1
  • 7
2
votes
1 answer

Speech-to-text number capture

Is there a method for capturing spoken numbers via using cmusphinx? Poking around in the cmudict-en-us.dict file I find the following, forty F AO R T IY forty-five F AO R T IY F AY V fifty F IH F T IY eighty EY T IY Rather than having Sphinx…
portsample
  • 1,986
  • 4
  • 19
  • 35