Questions tagged [sphinx4]

Sphinx-4 is a part of CMUSphinx Speech Recognition Toolkit. It's a flexible large and small vocabulary speech decoder written in Java and licensed under BSD license.

This tag is about Sphinx-4, a speech recognition decoder. Speech recognition is fastly growing domain and it's quite complex by its nature. The development of the speech recognition application requires understanding of the speech recognition specifics - a probabilistic nature of the results, the need for throughout testing, the specifics of the voice user interface design, the accuracy/speed balance.

The main concept you need to be aware about are acoustic model used to capture the sounds of the language, the language model used to capture the vocabulary and the dictionary which maps from words to sounds. The use of Sphinx-4 in your application is often straightforward but you need to be more careful than usually to get everything in place.

To learn more about CMUSphinx and Sphinx-4 visit CMUSphinx page

https://cmusphinx.github.io/wiki/

Read the tutorial

https://cmusphinx.github.io/wiki/tutorial/

255 questions
0
votes
1 answer

Make Sphinx4 Recognize all the numbers using custom .gram file

Description A speech recognizer calculator in Java Using Sphinx4 library exists. The full code on github: here The gram file i am using is the below(on github): #JSGF V1.0; /** * JSGF Grammar */ grammar grammar; public = (one |…
GOXR3PLUS
  • 6,877
  • 9
  • 44
  • 93
0
votes
1 answer

Full sentence voice recognition using sphinx

I am new to both java and sphinx4 Here i have downloaded sphinx and i am using eclipse editor so i added the jar files and my set up is ready Infact i also run the demo hello world example which was giving the output as expected . But Here in…
Shikha thakur
  • 1,269
  • 13
  • 34
0
votes
1 answer

What should I use between CMU Pocketsphinx and CMU Sphinx4 to get subtitles from video files?

I would like to extract subtitles from video files eventually. Current video files are located on physical disk, so they will be considered as train/test data. But imagine, that I have running web-app where I upload the fresh video and my web-app…
Novitoll
  • 820
  • 1
  • 9
  • 22
0
votes
0 answers

Property exception component:'jsgfGrammar'

So I have been trying to start writing my own code for Sphinx4. I created my own grammar, config, and java file, all necessary for it to run, and used the code from the HelloWorld demo. However, when I ran the code, I received this error: Problem…
0
votes
0 answers

Empty result from the recognizer

I am trying to use the CMU Sphinx speech recognizer to recognize some speech files I record in WPF: Here is the sample code I compiled: package com.example; import java.io.File; import java.io.FileInputStream; import java.io.InputStream; import…
Andrew Au
  • 812
  • 7
  • 18
0
votes
1 answer

Acoustic model compatibility issue

I am using sphinx4-5prealpha to do text-audio alignment. I use the Acoustic model and dictionary packed with this version. For some reason I want to use the same AM and acoustic model with an old sphinx-4 version but I get:…
user1828433
  • 252
  • 2
  • 11
0
votes
1 answer

Decide cluster size for speaker adaptation in Sphinx-4

In CMU Sphinx(Sphinx-4) for speaker adaptation technique, I am using following code snippet Stats stats = recognizer.createStats(nrOfClusters); recognizer.startRecognition(stream); while ((result = recognizer.getResult()) != null) { …
rishi007bansod
  • 1,283
  • 2
  • 19
  • 45
0
votes
1 answer

Sphinx4 Token units shifted

I am using Sphinx4 to do alignment with a text. I want to get the timing for each word in the sentence (start, end) and also the timing of each phoneme in the words. To do this I changed the code of the SpeechAligner. The method I edited is:…
user1828433
  • 252
  • 2
  • 11
0
votes
0 answers

Looking to create a Voice Recognition to Text using Java

I'm not sure how to ask this...but! I'm working on a voice recognition project. The app would be built as a phone bot... so scripted questions with yes or no replays.Ever thing shows as text in console. Q: So I know sphinx-4 has a good library but…
LawRose
  • 21
0
votes
1 answer

Changing config.xml in Sphinx4

I am running Transcriber demo in sphinx4 samples. But the accuracy is low. In Transcriber sources one of the html file says that this demo is for digits and to transcribe non digit data modify the config.xml. It says that "Please refer to the…
0
votes
1 answer

How to use adapted acoustic model in sphinx4

On sphinx documentation it's written that adaptation of inbuilt acoustic model is same in both sphinx4 and pocket sphinx and there is an another documentation which is for sphinx4 adaptation. But after adaptation how to transcribe the audio file? In…
0
votes
1 answer

Disable INFO messages from Sphinx4

How do I disable red INFO messages when running stream or live recognizer? I am using Configuration, not ConfigurationManager no I can't modify default.config.xml
Tahnik Mustasin
  • 2,216
  • 2
  • 18
  • 21
0
votes
1 answer

Speech recognition Sphinx4 Hindi

I am trying to work on Hindi speech recognition. I am getting the error in setting paths of models. I am setting them like shown below. Where am I going wrong? configuration …
Maddy
  • 1
0
votes
1 answer

How to create Dictionary file from vocab?

How to create dictionary(.dict) file for our specific domain Language model. I'm using CMU tool kit to create ARPA format Language model, but in that there is no option to create .dict file. Thanks in advance.
Sumitraj Patil
  • 107
  • 1
  • 1
  • 11
0
votes
1 answer

How to add own language model to java program using sphinx 4

I've successfully created the language model and other files using sphinxbase, sphinxtrain etc. I've now following list of files an4.dic an4.lm an4.lm.bin feat.params mdef means mixture_wights noisedict transition_matrices variances Now i ve to…