Questions tagged [sphinx4]

Sphinx-4 is a part of CMUSphinx Speech Recognition Toolkit. It's a flexible large and small vocabulary speech decoder written in Java and licensed under BSD license.

This tag is about Sphinx-4, a speech recognition decoder. Speech recognition is fastly growing domain and it's quite complex by its nature. The development of the speech recognition application requires understanding of the speech recognition specifics - a probabilistic nature of the results, the need for throughout testing, the specifics of the voice user interface design, the accuracy/speed balance.

The main concept you need to be aware about are acoustic model used to capture the sounds of the language, the language model used to capture the vocabulary and the dictionary which maps from words to sounds. The use of Sphinx-4 in your application is often straightforward but you need to be more careful than usually to get everything in place.

To learn more about CMUSphinx and Sphinx-4 visit CMUSphinx page

https://cmusphinx.github.io/wiki/

Read the tutorial

https://cmusphinx.github.io/wiki/tutorial/

255 questions
1
vote
1 answer

How to get more than one speech results in sphinx 4 using SpeechResult api?

result.getHypothesis() doesn't always give me best result so I want multiple results after recognization. I tried the code below, it returns me an empty array. Recognizer is an object of LiveSpeechRecognizer. SpeechResult result =…
parth karia
  • 230
  • 2
  • 7
1
vote
0 answers

"Property exception component:'wordPruningSearchManager' property:'logMath' - mandatory property is not set" in CMU-Sphinx

I am getting this error when i try to run below code of CMU-Sphinx Acoustic Model Exception is :- Exception in thread "main" Property exception component:'wordPruningSearchManager' property:'logMath' - mandatory property is not set! Full…
Keval
  • 1,857
  • 16
  • 26
1
vote
1 answer

Sphinx4 Demo Applications Do Not Work

I am doing a project in Java that requires me to use speech recognition software. Basically, I am programming a "social robot", a robot that you can interact with like a person. I am using the "Zeno R25". With that being said, speech recognition…
1
vote
1 answer

Why is my Sphinx4 Recognition poor?

I am learning how to use Sphinx4 using the Maven plug-in for Eclipse. I took the transcribe demo found on GitHub and altered it to process a file of my own. The audio file is 16bit, mono, 16khz. It is approximately 13 seconds long. I noticed that…
1
vote
2 answers

Sphinx4 breaks on AWS Elastic Beanstalk, works on dev machine

I created a dummy Maven project that wraps CMU Sphinx4 in a servlet. The dependencies are sphinx4-core:1.0-SNAPSHOT and sphinx4-data:1.0-SNAPSHOT. When running locally (Eclipse) it works great, but then I deploy it to AWS Elastic Beanstalk and I get…
1
vote
1 answer

sphinx-4 aligner skips plain words like `you`, `in` and words with dashes - why?

I'm trying to align simple text. Here are the links to text and audio files: http://s000.tinyupload.com/?file_id=48044768133759453374 http://s000.tinyupload.com/?file_id=99891199139563396901 Here is the configuration settings: private static final…
Max Koretskyi
  • 101,079
  • 60
  • 333
  • 488
1
vote
1 answer

Sphinx4: How can improve the accuracy of recognizing wav file in dialog demo

I have edited the dialog code to make it work for my project. I have created a text file with some of the possible sentences to be used in my work. I added the link in the comment section. I have followed the steps on…
1
vote
1 answer

Failed to load en-us-semi model in sphinx4

I've recently been tasked with rewriting a C server in Java, which has meant migrating its speech recognition capabilities from the Pocketsphinx C api over to the Sphinx4 Java API, using the same dictionary and language model files as those that…
MawrCoffeePls
  • 703
  • 1
  • 5
  • 14
1
vote
3 answers

Run sphinx4 recognizer in background with UI

I am trying to make my existing console based Java program which is a speech recognition project using sphinx, into a GUI based program. I tried altering the code, but the existing program does not run if I alter it. My existing code for recognition…
Robin
  • 5,366
  • 17
  • 57
  • 87
1
vote
1 answer

Sphinx4 figuring out correct models

I am trying to use the Sphinx4 library for speech recognition, but I cannot seem to figure out the correct combination of acoustic model-dictionary-language model. I have tried out various combinations and I get a different error every time. I am…
mori
  • 72
  • 10
1
vote
1 answer

Default config.xml?

On this page of the documentation, at the bottom, it says: You can find full examples of Sphinx-4 configuration file in sources. For example, check the file sphinx4/src/apps/edu/cmu/sphinx/demo/transcriber/config.xml Well, I looked, and there is…
user773737
1
vote
1 answer

Retrieval from the database with Sphinx4

Now I'm in the process of making a dictionary application using voice. I have made this dictionary and there are about 100000 words as database. This dictionary needs to be searched by voice. For this, I use Sphinx4 / cmusphinx as a tool to be used.…
davinma06
  • 23
  • 4
1
vote
0 answers

JAVA - Failed to iterate sphinx4 within swing component

I am using CMU sphinx, i am running my program but getting this error appears "Can't open microphone line with format PCM_SIGNED 16000.0 Hz, 16 bit, mono, 2 bytes/frame, big-endian not supported." Cannot start microphone. i want my program to listen…
davinma06
  • 23
  • 4
1
vote
1 answer

Sphinx4 OutOfMemoryError with custom configuration

I want to create a program that recognizes speech from a file .wav. I tried the code below, but it throws an exception Exception in thread "main" java.lang.OutOfMemoryError: GC overhead limit exceeded Even if my "eclipse.ini" have this property:…
Dorin
  • 2,167
  • 4
  • 20
  • 32
1
vote
1 answer

Large vocabulary speech recognition in sphinx4

As far as I know till now, sphinx4 requires grammar to identify the words. Is there anyway to get the input without using grammar rules, that is not in the grammar, somewhat like I am dictating and it will write what I will say?
Kakar
  • 5,354
  • 10
  • 55
  • 93