Questions tagged [cmusphinx]

An open-source toolkit for speech recognition. Includes speech recognizers Sphinx 2-4, PocketSphinx and a set of tools to supplement the recognizers.

CMU Sphinx is a group of speech recognition systems developed at Carnegie Mellon University.

Links:

682 questions
5
votes
1 answer

How to improve the accuracy for speech-to-text conversion using recognize_sphinx API in Python

How can we improve the accuracy of speech to text conversion using recognize_sphinx API in Python? Please find the below code, which needs to improve the accuracy base! import speech_recognition as sr # Obtain path to "english.wav" in the same…
5
votes
5 answers

Installing Pocketsphinx on Android

I have some questions on installing Pocketsphinx on and Android phone, and I haven't been able to find answers on CMUSphinx's support forum, IRC, existing StackOverflow posts, or intensive Googling. If anyone here has a minute and would be willing…
Zach Rattner
  • 20,745
  • 9
  • 59
  • 82
5
votes
2 answers

Pocketsphinx: Capturing real-time output of -inmic yes to .txt

I'm using pocketsphinx_continuous on Windows. Redirecting output to a text file works with the "-infile" argument, but fails with "-inmic yes". As noted in the question Does pocketsphinx flush stdout? pocketsphinx ignores stdout (at least when using…
sjw
  • 228
  • 1
  • 4
  • 8
5
votes
1 answer

Build NEW Acoustic model, Dictionary , Language model for uncommon language speech recognition

I want to build NEW Acoustic model ,New Dictionary ,New Language model for "Sinhala Language speech recognition" Sinhala language Characters are Unicode based. for an example A=අ,I=ඉ,U=උ,KA=ක,BA=බ. I did go through CMUSphinx Tutorial For…
5
votes
1 answer

Sphinx4 fails to find resources

I have a problem. You see, I'm creating this program, and I want voice recognition. CMUSphinx 4 seemed like the best option, so I just downloaded the jars and added them to my project as a compacted library. So I got this code from the CMUSphinx…
Max K
  • 65
  • 7
5
votes
1 answer

How can we convert .wav file to text by using pocketsphinx?

I installed pocketsphinx in my Linux machine correctly, and now I want to convert an audio file (.wave) to text by using pocketsphinx,how can i do that? is there any clear command and short command to do that? somthing like this command: …
5
votes
1 answer

AndroidPocketSphinx: build path contains duplicate entry: 'src' for project

I am trying to create a project (on eclipse ADT) from an existing project already unzipped into the workplace. I am basically following the instructions in: http://cmusphinx.sourceforge.net/2011/05/building-pocketsphinx-on-android/ But as soon as it…
AndroSphinx
  • 131
  • 1
  • 6
5
votes
1 answer

PocketSphinx python and setting the acoustic model?

I was reading this guide on speech recognition, and it mentioned that I need three items for speech recognition: Acoustic model, Language Model, Phonetic Dictionary. I wanted to start playing with this python demo, which uses Gstreamer to capture…
user1812844
  • 297
  • 2
  • 10
5
votes
1 answer

pocketsphinx how to determine out of grammar words

I am currently using pocketsphix demo (android and visual studio 2010) and I have configured a jsgf grammer like this #JSGF V1.0; grammar Names; public = muhammad | ahmed | maria | john | kelley | peter | jacob | jason; Whenever I…
Ahmed
  • 14,503
  • 22
  • 92
  • 150
5
votes
1 answer

Configuring Pocketsphinx properly

I have been installing Pocketsphinx0.7 on a VM running Debian Squeeze. This worked fine and I can try to recognize speech from files.Having this I've built some python scripts which recognize a bunch of files I got and then estimating the word error…
elramino
  • 53
  • 1
  • 6
4
votes
1 answer

Pack files assets into apk?

I have some files that need to use by my application when it is on the device. Right now, I can only done this by copy those files and paste it into device directly by using computer manually. So is there any way to pack or copy these files into apk…
SaintTail
  • 6,160
  • 4
  • 31
  • 51
4
votes
2 answers

Use CMU's sphinx4 to transcribe non-digits data

I am recently working on using CMU's sphinx4 for transcription and eventually forced alignment, i.e. aligning audio with its transcript. I found a project called AutoCap that basically did what I wanted to develop. So, I installed it but it did not…
4
votes
2 answers

TLSphinx cmusphinx pocketsphinx Hypothesis result text empty string score negative (-) number

I ran the sample code in the readme file at tryolabs/TLSphinx README.md, and the result of the text property of the Hypothesis is whitespace, while the score property is a negative number of -4420. Why am I not getting good results in the text…
daniel
  • 1,446
  • 3
  • 29
  • 65
4
votes
2 answers

Problems trying to build PocketSphinxAndroidDemo using NDK

I am trying to compile PocketSphinxAndroidDemo, which provides an example implementation of the CMU pocketsphinx speech recognizer on Android. I first received an error similar to the discussion here. After executing ndk-build, I got this…
Nick Ruiz
  • 1,405
  • 4
  • 18
  • 28
4
votes
1 answer

mllr_solve 'Error reading byte order magic number' on Windows

I'm following the Adapting Default Acoustic model tutorial and have hit a error running mllr_solve. INFO: main.c(382): -- 1. Read input mean, (var) and accumulation. ERROR: "pio.c", line 434: fread() failed; retrying... : No error ERROR: "swap.c",…
Ben
  • 10,931
  • 9
  • 38
  • 47
1 2
3
45 46