Questions tagged [cmusphinx]

An open-source toolkit for speech recognition. Includes speech recognizers Sphinx 2-4, PocketSphinx and a set of tools to supplement the recognizers.

CMU Sphinx is a group of speech recognition systems developed at Carnegie Mellon University.

Links:

682 questions
0
votes
0 answers

Sphinx3 compilation error

I am trying to use sphinx for the first time. I'm using sphinx3 because I need s3_aligner so I followed many tutorials but none of them got me a result I always get this error : **configure: error: in…
Bassem
  • 43
  • 2
  • 9
0
votes
2 answers

Bw tool in sphinxtrain cannot open mdef file on Windows

I am developing an speech recognition application using Sphinx4. I am trying to do acoustic model adaptation. I have followed every step of the instruction on the http://cmusphinx.sourceforge.net/wiki/tutorialadapt tutorial. However, in the step…
Le Hoang Long
  • 428
  • 3
  • 10
0
votes
1 answer

Run pocketsphinx_continuous with a keyphrase

I am trying to use a keyphrase with pocketsphinx, but it keeps throwing the error, ERROR: "kws_search.c", line 171: The word 'hey' is missing in the dictionary Even though it is 100% in the dictionary. It is a big part of the dictionary and it…
Mitchell Ingram
  • 686
  • 2
  • 11
  • 23
0
votes
2 answers

Streaming a file off a server

I am trying to stream the recorded audio from my raspberry pis to my desktop computer which handles pocketsphinx phenomenally. I can pipe the audio using arecord -D plughw:1,0 -r 16000 -f S16_LE | ssh -C user@192.168.86.101 sox - test.wav And…
Mitchell Ingram
  • 686
  • 2
  • 11
  • 23
0
votes
2 answers

UnsupportedClassVersionError while using CMU Sphinx 4 with maven project

I am implementing speech to text using sphinx4 and my code is as below : public class TranscriberDemo { public static void main(String[] args) throws Exception { Configuration configuration = new Configuration(); …
Jennifer
  • 351
  • 6
  • 18
0
votes
1 answer

How to get a CTM File from CMU Sphinx?

I have decoded my speech into phonemes by using my language model.I need to segment the speech into sentences and words.I know , a ctm file will help .Can anyone tell me how to generate the ctm file for a particular speech instance ?
0
votes
2 answers

Sphinx4 5alpha recognizer.getResult() null point exception

I build new language model , acoustic model and dictionary. when I try to recognize wav file speech, I got Exception in thread "main" java.lang.NullPointerException at edu.cmu.sphinx.result.Lattice.getNodeID Here Is my code, try { …
0
votes
1 answer

Sphinx4 Live Voice Recognition Only Works Once

I've been trying all day to get the Java Sphinx4 API to recognize my speech in real time, and it does. But it only does it once, after the program is opened. After it gets that first input, the program just freezes at recognizer.recognize();, even…
TheBeocro
  • 121
  • 1
  • 1
  • 8
0
votes
2 answers

how to manage hello.gram for conversation using sphinx and freetts in java

I am developing small voice based interaction tool using sphinx (Speech to Text) and FreeTTS(Text to Speech) in java. for eg : FreeTTS gives voice command like Name : user will reply his name and age and place.everything is static. How to write…
Spartan
  • 3,213
  • 6
  • 26
  • 31
0
votes
2 answers

how big hello.gram file be in sphinx java

I am new to sphinx java and started with small project of employee details.Download HelloWorld program and modifying it. how to get employee id (eg : T5438,Y7651,U9085) and department ( Auto, Support , Mobile) dynamically. How to declare below in…
Spartan
  • 3,213
  • 6
  • 26
  • 31
0
votes
1 answer

How to convert cmudict-0.7b or cmudict-0.7b.dict in to FST format to use it with phonetisaurus?

I am looking for a simple procedure to generate FST (finite state transducer) from cmudict-0.7b or cmudict-0.7b.dict, which will be used with phonetisaurus. I tried following set of commands (phonetisaurus Aligner, Google NGramLibrary and…
g10dras
  • 399
  • 2
  • 11
0
votes
1 answer

What is the ideal audio level for Sphinx?

On my system, using my USB microphone, I've found that the audio level that works best with CMU Sphinx is about 20% of the maximum. This gives me 75% voice recognition accuracy. If I amplify this digitally I get far worse recognition accuracy (25%).…
dingo
  • 1
0
votes
1 answer

Can I disable CMU Sphinx grammar processing?

I am noticing that Sphinx attempts to figure out the grammar of what I'm speaking. frequently it will make mistakes that involve inserting the word the, or other articles. is there any way to get it to just tell me the exact words I have spoken?
dingo
  • 1
0
votes
1 answer

How to disable noise cancellation in sphinx4-5prealpha?

I'm aware that static noise cancellation with spectral subtraction is enabled by default in latest version sphinx4-5prealpha, as was said in this topic: https://stackoverflow.com/a/29062128/5534684 But there is some way of disable this feature and…
0
votes
1 answer

Where to install CMU Sphinx adaption files?

I've done the various steps to adapt CMU Sphinx using some recorded WAV files. Now where to the resulting files go? I want to let each user on the system adapt CMU Sphinx to his own voice, so I don't want to store the files in /usr/local. Is there…
hello4000
  • 1
  • 2