Mozilla DeepSpeech is a TensorFlow implementation of Baidu's DeepSpeech architecture.
Questions tagged [mozilla-deepspeech]
103 questions
0
votes
1 answer
Convert cURL command to python using requests
I am using deepspeech and deespeech-server. I am able to send the cURL command:
curl -X POST --data-binary @what_time_is_it.wav http://localhost:8080/stt
This gives me the correct speech to text translation "what time is it".
I am now trying to…

O.Awoniyi
- 13
- 5
0
votes
1 answer
Pip unable to find deepspeech / deepspeech-gpu from versions on Windows
I'm trying to install deepspeech from Pypi on windows, I keep running into the issue of "Could not find required package from versions", my pip is up-to-date though ( 19.0.1).
I already tried some solutions given here:…

Amith Adiraju
- 306
- 4
- 18
0
votes
1 answer
Forced alignment using Aeneas with multible aeneas text files
We have started a project to create a Turkish speech recognition dataset to use with DeepSpeech.
We finished preprocessing task of Ebook.
But we couldn't finish the forced alignment process with Aeneas.
According to its tutorials for forced…

Mustafa Sarıalp
- 13
- 1
- 7
0
votes
1 answer
Why is Mozilla Deepspeech using Tensorflow 0.11 when I have 0.12 installed?
I use Anaconda3 with python 3.6 and use pip install tensorflow deepspeech.
When I run the following command I get errors:
deepspeech --model models/output_graph.pb --alphabet models/alphabet.txt --audio voice.wav
Loading model from file…

Rolando
- 58,640
- 98
- 266
- 407
0
votes
1 answer
How many epochs typically required for a decent training of mozilla deep-speech?
I'd like to know, How many epochs you typically run for training mozilla deep speech, with its default architecture. (no changes from GitHub version). What would be the decent number of epochs to run, especially considering it takes quite a bit of…

Rajesh D
- 311
- 1
- 2
- 12
0
votes
0 answers
deepspeech training audio file length
Is it compulsory to have training and inferring audio file length equal to 5 seconds?
I have this questions because I have a large amount of training data with audio(every audio more than 30 seconds) and respective transcripts. If I can’t use this…

megha
- 115
- 1
- 2
- 10
0
votes
1 answer
Increasing --train_batch_size 2 to --train_batch_size 3 causes Mozilla DeepSpeech not to train anymore. Why?
Increasing --train_batch_size 2 to --train_batch_size 3 causes Mozilla DeepSpeech not to train anymore. What could explain this?
Specifically, if I run
./DeepSpeech.py --train_files data/common-voice-v1/cv-valid-train.csv --dev_files \
…

Franck Dernoncourt
- 77,520
- 72
- 342
- 501
0
votes
1 answer
What does the "Test of Epoch [number]" mean in Mozilla DeepSpeech?
What does the "Test of Epoch [number]" mean in Mozilla DeepSpeech?
In the following example, it says Test of Epoch 77263, even though there should be just 1 epoch from my understanding, since I gave --display_step 1 --limit_train 1 --limit_dev 1 …

Franck Dernoncourt
- 77,520
- 72
- 342
- 501
0
votes
3 answers
Deepspeech - inferring for more audio files and saving the output
I am done with my training on common voice data for deepspeech from Mozilla and now I am able to get output for a single audio .wav file. Below is the command I am using.
(deepspeech-venv)…

megha
- 115
- 1
- 2
- 10
0
votes
2 answers
Command not found error for deepspeech if installed with user flag
I installed deepspeech in ubuntu 16.04 using
pip install deepspeech --user
but when I use deepspeech deepspeech -h on cli it says command not found.
I have tried both pip and pip3 for installation, also tried restarting but it still says command…

Prem Kumar
- 27
- 2
- 8
0
votes
0 answers
Unable to install swig and swig decoders in Redhat 7 or ubuntu
I have installed the swig and swig decoder package from the below methods:
Swig:
yum install swig
Swig Decoders:
Clone https://github.com/PaddlePaddle/DeepSpeech/
Cd into decoders/swig/ and run ./setup.sh
Problem:
When I am importing the swig…

Rituraj kumar
- 349
- 1
- 4
- 15
-1
votes
1 answer
Python speech recognition and deepspeech
I'm trying to develop an offline voice assistant using python. I want to use speech recognition module by streaming using microphone and deepspeech. I didn't find something similar like this in the internet. How to do this?

Echu
- 1
-1
votes
2 answers
python Deepspeech TypeError: __init__() takes 2 positional arguments but 3 were given
I'm trying to implement deepspeech and as part of the implementation, I'm trying to run the below code. I have installed python3 and deepspeech-0.9.3
While running the below code, I'm getting an error as line 17, in model =…

Deepak Bandi
- 1,854
- 4
- 21
- 37