Questions tagged [cmusphinx]

An open-source toolkit for speech recognition. Includes speech recognizers Sphinx 2-4, PocketSphinx and a set of tools to supplement the recognizers.

CMU Sphinx is a group of speech recognition systems developed at Carnegie Mellon University.

Links:

682 questions
8
votes
2 answers

How do I build a large-vocabulary language model for CMU Sphinx?

I would like to build a language model for CMU Sphinx, but my corpus has more than 1000 words so I cannot use the online tool. How do I use (the scripts in cmuclmtk?) to build my language model?
joeforker
  • 40,459
  • 37
  • 151
  • 246
8
votes
4 answers

Can't access microphone while running Dialog demo in sphinx4 5prealpha

I am trying to run the dialog demo of sphinx 4 pre aplha but it gives errors. I am creating a live speech application. I imported the project using maven and followed this guide on stack overflow: https://stackoverflow.com/a/25963020/2653162 The…
abc
  • 117
  • 2
  • 11
8
votes
1 answer

How to give an input wav file to pocket sphinx

Is there some command line utility of pocket sphinx or cmu sphinx to convert a .wav file to text? pocketsphinx_continuous -hmm -lm -dict will do. But I don't want to keep speaking the same sentence again and again.
nizam.sp
  • 4,002
  • 5
  • 39
  • 63
8
votes
1 answer

Python having trouble accessing usb microphone using Gstreamer to perform speech recognition with Pocketsphinx on a Raspberry Pi

So python is acting like acting like it can't hear ANYTHING from my microphone at all. Here's the problem. I have a Python ( 2.7 ) script that is suppose to be using Gstreamer to access my microphone and do speech recognition for me via…
Malcolm Jones
  • 1,472
  • 1
  • 12
  • 24
8
votes
1 answer

Using JSGF grammar instead of DMP language model (use -jsgf instead of -lm) in CMU Sphinx (pocketsphinx)

I need to train CMU Sphinx offline in my android app. I downloaded this project that use this parameter but when I start it I have this error: Fatal signal 11 (SIGSEGV) at 0x0000001c (code=1). This error I also have when Sphinx can not find the…
Sergey Pekar
  • 8,555
  • 7
  • 47
  • 54
8
votes
1 answer

Convert audio files for CMU Sphinx 4 input

I have a big batch of files I'd like to run recognition on using CMU Sphinx 4. Sphinx requires the following format: 16 khz 16 bit mono little-endian My files are something like 44100 khz, 32 bit stereo mp3 files. I tried using Tritonus, and then…
Nate Glenn
  • 6,455
  • 8
  • 52
  • 95
7
votes
1 answer

How to split speech data on frames and compute MFCC

I understand the basic steps of creating an automated speech recognition engine. However, I need a clear-er idea of how segmentation is done and what are frames and samples. I will write down what I know and expect the answer-er to correct me in the…
cipher
  • 2,414
  • 4
  • 30
  • 54
7
votes
3 answers

Live recognition with Python and Pocketsphinx

I have recently been working with pocket sphinx in python. I have successfully got the example below to work recognising a recorded wav. #!/usr/bin/env python import sys,os def decodeSpeech(hmmd,lmdir,dictp,wavfile): """ Decodes a…
Tonderai Ratisai
  • 533
  • 4
  • 8
  • 17
7
votes
3 answers

How to use CMU Sphinx speech recognition with Ruby application?

I'm searching for how to use CMU Sphinx with Ruby (Rails) application. I need very simple task - I have an mp3 file and I want get it transcribed into text. How can I implement this easiest way? I dont' know C/C++ and my task isn't so big to learn…
Alve
  • 1,315
  • 2
  • 17
  • 16
6
votes
1 answer

Python Speech Compare

I have two .wav files that I need to compare and decide if they contain the same words (same order too). I have been searching for the best method for a while now. I can't figure out how to have pyspeech use a file as input. I've tried getting the…
Kreuzade
  • 757
  • 5
  • 11
  • 22
6
votes
1 answer

Dictation Application using Sphinx4

My requirements are similar to this question since the question is now 3 years old I am re-posting the question with information specific to mine, I want to create an application which takes a .wav (or any other standard audio file format) and…
Amit
  • 13,134
  • 17
  • 77
  • 148
6
votes
1 answer

CMU Sphinx for Voice/Speaker Recognition

I'm looking for a way to match a known data set, let's say a list of MP3s or wav files, each which is a sample of someone speaking. At this point I know file ABC is of Person X speaking. I would then like to take another sample, and do some voice…
6
votes
2 answers

Is CMU Sphinx available via Maven?

I have an idea for an app that may need CMU Sphinx, is it available via maven, or do I need to add it by hand?
Levi Campbell
  • 6,005
  • 7
  • 40
  • 47
6
votes
3 answers

Speech recognition for android

I want to speech recognition for burmese language. I refer following links…
kalpana c
  • 2,739
  • 3
  • 27
  • 47
5
votes
2 answers

German pronunciation dictionary for CMU Sphinx

I am looking for a German pronunciation dictionary in order to use for PocketSphinx / CMU Sphinx. The dictionary looks like this: BACK B AE K CALL K AO L NO N OW NUMBER N AH M B ER ONE W AH N ONE(2) HH W AH N REPLY R IH P L AY REPLY(2) …
Force
  • 6,312
  • 7
  • 54
  • 85
1
2
3
45 46