Questions tagged [vosk]

Questions about Vosk Speech Recognition Library

Vosk is a speech recognition toolkit. The best things in Vosk are:

Supports 9 languages - English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese. More to come.
Works offline, even on lightweight devices - Raspberry Pi, Android, iOS
Installs with simple pip3 install vosk
Portable per-language models are only 50Mb each, but there are much bigger server models available.
Provides streaming API for the best user experience (unlike popular speech-recognition python packages)
There are bindings for different programming languages, too - java/csharp/javascript etc.
Allows quick reconfiguration of vocabulary for best accuracy.
Supports speaker identification beside simple speech recognition.

65 questions

vote

1 answer

vosk translation output only shows last batch of translation

when I'm trying to run test_ffmpeg.py using my own audio file, it doesn't really show all the translations at the end. but it only shows the last paragraph/batch. like, my audio file goes as this: The quick brown fox jumps over the lazy dog The…

python vosk

asked Dec 07 '20 at 18:57

chip

3,039
5
35
59

vote

1 answer

install alphacep(VOSK APİ) Python

Remembrances. I https://github.com/alphacep/vosk-api I'm trying to set it up, but I can't, can anyone set it up before ?

python speech-to-text vosk

asked May 19 '20 at 16:15

StabilBey

votes

1 answer

Works on Pycharm but nonexecutable (unable to find 'vosk')

I tried to make speech recognition program using vosk model Everything worked on pycharm and I built exe file using command pyinstaller -w -F --icon=vtt_Icon.ico -n "Video Searcher" main.py However when I execute it, the message shows like the…

python pyqt5 vosk

asked Jul 27 '23 at 10:08

JRK

votes

0 answers

Vosk Bug in Speech to Text script

I am working on a python script that uses Vosk to transcripe OS and Mic audio. I want it to work with multiple threads. Operational order: start both audio streams go into a while loop that reads from the streams and safes the data seperately start…

python multithreading audio while-loop vosk

asked Jun 22 '23 at 07:32

morigan

votes

0 answers

ASP.NET Core Web API Docker VOSK library

I wrote a service that uses VOSK and I put it in a docker container. It works fine on Windows container, but when i switch it to Linux and sent the request to the endpoint I get the following error: System.DllNotFoundException: 'Unable to load…

.net linux docker vosk

asked Jun 13 '23 at 16:51

Marko

votes

0 answers

How to detector or extract number from the speech using vosk library

How to detector or extract number from the speech using vosk library i have implemented vosk library to recognise speech in Android, now i am able to recognise speech . But , I want to detect digits as numeric currently library returning word of…

android speech-recognition vosk

asked May 05 '23 at 06:16

android2892

votes

0 answers

Assertion failed when running vosk in a Node.js code on Windows

I have a code that compare the accuracy and flunecy of a text file and an audio file. I am getting an ASSERTION_FAILED error. Does anyone now how to fix this? node .\compare-basic.js LOG (VoskAPI:ReadDataFiles():model.cc:213) Decoding params beam=13…

node.js speech-to-text vosk

asked Apr 20 '23 at 23:56

John2

votes

0 answers

Could not find a version error while Installing vosk in Pydroid 3

I have tried to install vosk in Pydroid 3 https://pypi.org/project/vosk/ But i got error, here is the log: Looking in indexes: http://repo.local ERROR: Could not find a version that satisfies the requirement vosk (from versions: none) ERROR: No…

python vosk

asked Feb 24 '23 at 04:20

Willl

votes

1 answer

I cannot install vosk in npm

I want install vosk in npm but receive the following error: npm ERR! code 1 npm ERR! path C:\Users\WINDOWS 11\node_modules\ffi-napi npm ERR! command failed npm ERR! command C:\WINDOWS\system32\cmd.exe /d /s /c node-gyp-build npm ERR! gyp info it…

vosk

asked Feb 19 '23 at 08:21

GinzaTech

votes

0 answers

How can I programmatically determine the sampling rate and bitrate of a WAV file?

There is a program written in python which recognizes text with the vosk library used. To recognize text you need to pass the sampling rate and the language model with this line : KaldiRecognizer(model, 96000), where 96000 is the frequency itself.…

python wav vosk

asked Feb 14 '23 at 20:06

Alerto

votes

0 answers

Vosk settings to set script

I am using below code block for Vosk speech-to-text from their example. Is there a way to set all text so the recognition became more accurate and maybe faster? I could not find any solid document on that one but found 2 things recognizer.Aligment…

.net-core vosk

asked Feb 06 '23 at 13:50

Onur Topal

3,042
1
24
41

votes

0 answers

VOSK in Python on Mac not getting/reading microphone input

I am trying to use VOSK to transcribe voice input from my M1 MacBook Air microphone and the program runs fine, it just gets no input from the microphone. I found that if I use the larger English voice model with a '''exception_on_overflow =…

python pyaudio kaldi vosk

asked Jan 13 '23 at 05:00

Christopher

votes

0 answers

Speech to Text Model, where the model doesn't attempt to correct errors/grammer?

Is there an Vosk speech to text model, or any other open source/closed sourced model, where the model would output the spoken words into text. But it wouldn't correct them into proper words or fix their grammer, just output what they are saying in…

artificial-intelligence speech-to-text vosk

asked Dec 17 '22 at 01:54

Deus

votes

0 answers

speech recognization problem while making a call on speakerphone

I'm using vosk freeswitch plugin to do speech recognization, and playAndGetDigits function to play an audio to user while doing speech recognization. There is a problem that while user is using speakerphone to make the call and is keeping quiet, the…

speech-recognition speech-to-text freeswitch vosk

asked Nov 20 '22 at 10:43

Allan

votes

1 answer

how to get the details of ASR VOSK

I have working with Vosk and I need to get the time of each word in my file.mp3 this is my code def voice_recognition(filename): model = Model(model_name="vosk-model-fa-0.5") rec = KaldiRecognizer(model, FRAME_RATE) rec.SetWords(True) …

tensorflow nlp speech-recognition vosk

asked Nov 16 '22 at 05:54

miladjurablu

Prev 1 2

4 5 Next