Questions tagged [mozilla-deepspeech]

Mozilla DeepSpeech is a TensorFlow implementation of Baidu's DeepSpeech architecture.

Code: https://github.com/mozilla/DeepSpeech

103 questions
0
votes
0 answers

how to install DeepSpeech (2023)

I understand DeepSpeech project is not being maintained now hence it cannot be installed using pip. I get the following error when trying to install using python3 setup.py install after cloning the source code. Searching for…
afsara_ben
  • 542
  • 1
  • 11
  • 30
0
votes
0 answers

deepspeech cannot recognize the words accurately

I am writing a real-time speech to text program. I am using Deepspeech for STT and Sounddevice for microphone capturing. However, I find the words form the audio seems to be "extended" and cannot be recognized accurately. For example, When I said…
0
votes
1 answer

When running DeepSpeech training on Windows, I encountered an error with multiple processes accessing one memory at the same time

Running on pycharm yielded the following error,what should I do? The following is the error message: Windows fatal exception: access violation Thread 0x000013c0 (most recent call first): File "E:\anaconda\envs\DeepSpeech\lib\threading.py", line 295…
0
votes
2 answers

Unable to install the deepspeech library from pip. Can't find the module

I am attempting to install the Deepspeech library for Python on my Ubuntu 22.04 system. I've created a virtual environment and then run pip install deepspeech. However, all I get back is the following error: ERROR: Could not find a version that…
redmage123
  • 413
  • 8
  • 15
0
votes
0 answers

stt.node version incompatible with electron

I'm getting below error when trying to launch the application. This is after I deleted node_modules and re-ran npm i to pull packages again App threw an error during load node_modules/electron/dist/resources/default_app.asar/main.js:113 Error:…
0
votes
0 answers

Audio Length is 0 and so is the results for DeepSpeech Example

I am following the deepspeech example for nodejs_wav and I keep getting the following result, audio length 0 result: The audio files are present as well. Here are the additional console output I get when I run the code using node…
Snoopy
  • 1,257
  • 2
  • 19
  • 32
0
votes
0 answers

Deepspeech NodeJs Module_Not_Found Err

Cannot Run the following code, everything I try to run it I run it into the issue of node:internal/modules/cjs/loader:959 throw err; Is this issue with me using Windows, or is it that my packages aren't installed correctly because I installed all…
Deus
  • 1
  • 1
0
votes
1 answer

Can't import deepspeech on kivy for android

I am using kivy to create an android app. I need to install the deepspeech framework, however, in order for deepspeech to be installed it is necessary to create a recipe. I created a recipe and built the apk, there were no errors in the build, it…
0
votes
0 answers

Can Deepspeech confidence be used to calculate accuracy?

In Deepspeech documentation, definition of confidence is: Confidence is roughly the sum of the acoustic model logit values for each timestep/character that contributed to the creation of this transcription. But on running on different audios,…
0
votes
1 answer

Can speech diarization be be integrated with deepspeech?

In an online meeting such as Google Meet/ Zoom, I want to detect change of speaker and then transcribe the audio for different speakers. I am using Deepspeech model for speech to text. I have fine-tuned the model for Indian accent english but I want…
0
votes
1 answer

Loading .pbmm and .scorer file (Mozilla DeepSpeech)

I'm trying to load .pbmm and .scorer models available in the Mozilla Deepspeech documentation. The load_model() function of Tensorflow doesn't support this file type and throws this error: OSError: Unable to open file (file signature not found). I…
0
votes
1 answer

Getting following error on generating language scorer on Deepspeech

File "generate_scorer_package", line 1 SyntaxError: Non-UTF-8 code starting with '\xea' in file generate_scorer_package on line 2, but no encoding declared; see http://python.org/dev/peps/pep-0263/ for details
0
votes
1 answer

Usiing deepspeech package for automatic speech recognition

please i need some help. How can I use deepspeech as an API directly in google colab without using the command prompt : I want to load the pre_trained model,instanciate it and create a function that takes as input and audio file and returns the…
0
votes
0 answers

Building an executable with PyInstaller on Windows

I want to create a Windows executable using PyInstaller. My project has the following dependencies: deepspeech PyAudio (which in turn depends on the PortAudio C library) tkinter I tried building the executable by running the command…
Shane Bishop
  • 3,905
  • 4
  • 17
  • 47
0
votes
1 answer

How to use GPU when transcribing using deepspeech

I'm using the excellent deepspeech package to transcribe an audio file in Python. Here's my quick implementation: import wave import deepspeech import numpy as np model_file_path = 'deepspeech-0.9.3-models.pbmm' model =…
mmz
  • 1,011
  • 1
  • 8
  • 21