Questions tagged [mozilla-deepspeech]

Mozilla DeepSpeech is a TensorFlow implementation of Baidu's DeepSpeech architecture.

Code: https://github.com/mozilla/DeepSpeech

103 questions
0
votes
1 answer

Convert cURL command to python using requests

I am using deepspeech and deespeech-server. I am able to send the cURL command: curl -X POST --data-binary @what_time_is_it.wav http://localhost:8080/stt This gives me the correct speech to text translation "what time is it". I am now trying to…
O.Awoniyi
  • 13
  • 5
0
votes
1 answer

Pip unable to find deepspeech / deepspeech-gpu from versions on Windows

I'm trying to install deepspeech from Pypi on windows, I keep running into the issue of "Could not find required package from versions", my pip is up-to-date though ( 19.0.1). I already tried some solutions given here:…
0
votes
1 answer

Forced alignment using Aeneas with multible aeneas text files

We have started a project to create a Turkish speech recognition dataset to use with DeepSpeech. We finished preprocessing task of Ebook. But we couldn't finish the forced alignment process with Aeneas. According to its tutorials for forced…
0
votes
1 answer

Why is Mozilla Deepspeech using Tensorflow 0.11 when I have 0.12 installed?

I use Anaconda3 with python 3.6 and use pip install tensorflow deepspeech. When I run the following command I get errors: deepspeech --model models/output_graph.pb --alphabet models/alphabet.txt --audio voice.wav Loading model from file…
Rolando
  • 58,640
  • 98
  • 266
  • 407
0
votes
1 answer

How many epochs typically required for a decent training of mozilla deep-speech?

I'd like to know, How many epochs you typically run for training mozilla deep speech, with its default architecture. (no changes from GitHub version). What would be the decent number of epochs to run, especially considering it takes quite a bit of…
Rajesh D
  • 311
  • 1
  • 2
  • 12
0
votes
0 answers

deepspeech training audio file length

Is it compulsory to have training and inferring audio file length equal to 5 seconds? I have this questions because I have a large amount of training data with audio(every audio more than 30 seconds) and respective transcripts. If I can’t use this…
megha
  • 115
  • 1
  • 2
  • 10
0
votes
1 answer

Increasing --train_batch_size 2 to --train_batch_size 3 causes Mozilla DeepSpeech not to train anymore. Why?

Increasing --train_batch_size 2 to --train_batch_size 3 causes Mozilla DeepSpeech not to train anymore. What could explain this? Specifically, if I run ./DeepSpeech.py --train_files data/common-voice-v1/cv-valid-train.csv --dev_files \ …
Franck Dernoncourt
  • 77,520
  • 72
  • 342
  • 501
0
votes
1 answer

What does the "Test of Epoch [number]" mean in Mozilla DeepSpeech?

What does the "Test of Epoch [number]" mean in Mozilla DeepSpeech? In the following example, it says Test of Epoch 77263, even though there should be just 1 epoch from my understanding, since I gave --display_step 1 --limit_train 1 --limit_dev 1 …
Franck Dernoncourt
  • 77,520
  • 72
  • 342
  • 501
0
votes
3 answers

Deepspeech - inferring for more audio files and saving the output

I am done with my training on common voice data for deepspeech from Mozilla and now I am able to get output for a single audio .wav file. Below is the command I am using. (deepspeech-venv)…
megha
  • 115
  • 1
  • 2
  • 10
0
votes
2 answers

Command not found error for deepspeech if installed with user flag

I installed deepspeech in ubuntu 16.04 using pip install deepspeech --user but when I use deepspeech deepspeech -h on cli it says command not found. I have tried both pip and pip3 for installation, also tried restarting but it still says command…
Prem Kumar
  • 27
  • 2
  • 8
0
votes
0 answers

Unable to install swig and swig decoders in Redhat 7 or ubuntu

I have installed the swig and swig decoder package from the below methods: Swig: yum install swig Swig Decoders: Clone https://github.com/PaddlePaddle/DeepSpeech/ Cd into decoders/swig/ and run ./setup.sh Problem: When I am importing the swig…
Rituraj kumar
  • 349
  • 1
  • 4
  • 15
-1
votes
1 answer

Python speech recognition and deepspeech

I'm trying to develop an offline voice assistant using python. I want to use speech recognition module by streaming using microphone and deepspeech. I didn't find something similar like this in the internet. How to do this?
-1
votes
2 answers

python Deepspeech TypeError: __init__() takes 2 positional arguments but 3 were given

I'm trying to implement deepspeech and as part of the implementation, I'm trying to run the below code. I have installed python3 and deepspeech-0.9.3 While running the below code, I'm getting an error as line 17, in model =…
Deepak Bandi
  • 1,854
  • 4
  • 21
  • 37
1 2 3 4 5 6
7