Questions tagged [vosk]

Questions about Vosk Speech Recognition Library

Vosk is a speech recognition toolkit. The best things in Vosk are:

Supports 9 languages - English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese. More to come.
Works offline, even on lightweight devices - Raspberry Pi, Android, iOS
Installs with simple pip3 install vosk
Portable per-language models are only 50Mb each, but there are much bigger server models available.
Provides streaming API for the best user experience (unlike popular speech-recognition python packages)
There are bindings for different programming languages, too - java/csharp/javascript etc.
Allows quick reconfiguration of vocabulary for best accuracy.
Supports speaker identification beside simple speech recognition.

65 questions

vote

1 answer

How to manipulate audio without saving it in Flask with pydub

I'm creating my first Flask API. I send audio file from React Native to Flask. Then I want to process it with vosk module (convert audio to text). Is it possible to manage it without saving the file on server? This is the code that works but saves…

asked Dec 25 '22 at 07:41

Xenia

vote

1 answer

Why is my flutter app crashing in release mode and working fine in debug mode?

I am currently developing a voice assisted flutter app using vosk and the vosk-flutter-plugin linked below. https://github.com/alphacep/vosk-flutter The app works completely fine in debug mode but crashes upon opening the view that accesses the…

flutter debugging build release vosk

asked Dec 16 '22 at 16:16

yanekwd

vote

0 answers

realtime speech to text with vosk and streamlit

i am trying to build a realtime offline speech recognition system using vosk and streamlit. i completed the vosk normal statement recognition but it doesnt working in live . when i am using threading in python it gives main thread error and also…

vosk

asked Nov 12 '22 at 10:58

Voleti Nagendra kumar

vote

0 answers

How to add words to the dictionary of the English vosk model?

I am using the english vosk model (vosk-model-en-us-0.22, large) in Python to recognize speech from files. I need to add specific words to it dictionary that are not originally there. In the file at: user\…

python speech-recognition speech-to-text speech vosk

asked Nov 04 '22 at 23:22

user522913

vote

0 answers

from vosk import Model, KaldiRecognizer ImportError: No module named vosk

I am working with vosk software using orange pi zero 256mb and python. I try so many idea and always ı see this error. I installed old version of Vosk and download old-new (.whl) file make a virtual environment (env) but doesnt work anyone.…

raspberry-pi putty orange-pi vosk

asked Oct 21 '22 at 20:15

Mesut

vote

0 answers

how to submit an rtsp stream to pyaudio for further processing

I have a function that connects to my microphone and recognizes speech as a string with a phrase. It looks like this: import json, pyaudio from Analysis import Voice_tag, nlp from vosk import Model, KaldiRecognizer class VoiseRecorder(): …

python pyaudio vosk

asked Oct 07 '22 at 07:47

Serega

vote

0 answers

how can i fix the error with the "ivector" folder from vosk model?

I wanted to run my Vosk voice recognition project but got this error. How can I fix this? ERROR (VoskAPI:DecodableNnetLoopedOnlineBase():decodable-online-looped.cc:50) Ivector feature dimension mismatch: got -1 but network expects 40 In folder…

python python-3.x directory model vosk

asked Jan 12 '22 at 07:45

Кирилл Мазиев

vote

1 answer

Is it possible to get the timing of phonemes with Vosk, instead of full words?

Vosk outputs the timing of words well, but for syllables / phonemes within words, does Vosk have an option or technique to split the output by those instead?

vosk

asked Sep 15 '21 at 02:54

tscizzle

11,191
15
54
88

vote

0 answers

Pygame key-presses don't work on 2nd run of cell in Jupyter Lab

When I run the code below after a kernel restart in Jupyter Lab if I hit the space bar, a highlighted 'aaa' will result (as it should). But, if I ever have to re-run the same cell again (without a kernel restart), a space bar hit will just result in…

pygame jupyter-lab vosk

asked Aug 01 '21 at 13:59

multilingualtricker

vote

1 answer

Cannot run .c because of segmentation fault using vosk

I'm on ubuntu 18.04 and i'm trying to run a .c file that came with an API called vosk that i just want to run . The issue is that the python script (which comes standard with the API) works without any problems but after compiling with make the .c…

c linux kaldi vosk

asked Jul 16 '21 at 08:38

Birto

vote

1 answer

What does "ERROR: fstsymbols: Saving osymbols but there are no output symbols" means and how can i solve it?

I wanted to update the language model of Kaldi model used in Vosk. I was following the Updating the language model in Vosk Adaptation I had this ERROR: fstsymbols: Saving osymbols but there are no output symbols when I try to run fstsymbols…

openfst vosk

asked Jun 22 '21 at 01:51

323 Kobby

vote

1 answer

Creating a project specific Vosk dictionary

I am working on an application which uses Vosk for speech recognition. I would like to create a dictionary for the application which contains only the trigger words and spoken numbers needed by the application. Using command line instructions found…

speech-to-text kaldi vosk

asked Jun 10 '21 at 20:44

portsample

1,986
4
19
35

vote

2 answers

Electron cannot open shared object file from node_modules folder

On electron, the node module vosk needs to access some shared objects located in node_modules/vosk/lib/. The issue I am having right now is that, when I do require('vosk') in my main.js and try to execute my AppImage file, I get: A JavaScript error…

node.js electron vosk

asked Mar 25 '21 at 01:19

Sharcoux

5,546
7
45
78

vote

0 answers

How to send data from RecordRTC to Vosk for Speech-to-text

I am using a Vosk server for speech-to-text conversion. I send the audio/wav blob data obtained using this method recorder.stopRecording(function() { var blob = this.blob; // below one is recommended var blob = this.getBlob(); }); from…

node.js websocket webrtc wav vosk

asked Jan 25 '21 at 02:15

Ikun

vote

1 answer

Need to reload vosk model for every transcription?

The vosk model that I'm using is the vosk-model-en-us-aspire-0.2 (1.4GB). Every time needs quite an amount of time to load the vosk model. Is it necessary to recreate the vosk object every time? It takes much time to load the model if we only load…

python transcription kaldi vosk

asked Jan 16 '21 at 13:01

Ming Zhong Tan

Prev 1

3 4 5 Next