Questions tagged [vosk]

Questions about Vosk Speech Recognition Library

Vosk is a speech recognition toolkit. The best things in Vosk are:

  • Supports 9 languages - English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese. More to come.
  • Works offline, even on lightweight devices - Raspberry Pi, Android, iOS
  • Installs with simple pip3 install vosk
  • Portable per-language models are only 50Mb each, but there are much bigger server models available.
  • Provides streaming API for the best user experience (unlike popular speech-recognition python packages)
  • There are bindings for different programming languages, too - java/csharp/javascript etc.
  • Allows quick reconfiguration of vocabulary for best accuracy.
  • Supports speaker identification beside simple speech recognition.
65 questions
1
vote
1 answer

vosk translation output only shows last batch of translation

when I'm trying to run test_ffmpeg.py using my own audio file, it doesn't really show all the translations at the end. but it only shows the last paragraph/batch. like, my audio file goes as this: The quick brown fox jumps over the lazy dog The…
chip
  • 3,039
  • 5
  • 35
  • 59
1
vote
1 answer

install alphacep(VOSK APİ) Python

Remembrances. I https://github.com/alphacep/vosk-api I'm trying to set it up, but I can't, can anyone set it up before ?
StabilBey
  • 41
  • 3
0
votes
1 answer

Works on Pycharm but nonexecutable (unable to find 'vosk')

I tried to make speech recognition program using vosk model Everything worked on pycharm and I built exe file using command pyinstaller -w -F --icon=vtt_Icon.ico -n "Video Searcher" main.py However when I execute it, the message shows like the…
JRK
  • 23
  • 6
0
votes
0 answers

Vosk Bug in Speech to Text script

I am working on a python script that uses Vosk to transcripe OS and Mic audio. I want it to work with multiple threads. Operational order: start both audio streams go into a while loop that reads from the streams and safes the data seperately start…
morigan
  • 23
  • 9
0
votes
0 answers

ASP.NET Core Web API Docker VOSK library

I wrote a service that uses VOSK and I put it in a docker container. It works fine on Windows container, but when i switch it to Linux and sent the request to the endpoint I get the following error: System.DllNotFoundException: 'Unable to load…
Marko
  • 11
  • 4
0
votes
0 answers

How to detector or extract number from the speech using vosk library

How to detector or extract number from the speech using vosk library i have implemented vosk library to recognise speech in Android, now i am able to recognise speech . But , I want to detect digits as numeric currently library returning word of…
android2892
  • 43
  • 1
  • 6
0
votes
0 answers

Assertion failed when running vosk in a Node.js code on Windows

I have a code that compare the accuracy and flunecy of a text file and an audio file. I am getting an ASSERTION_FAILED error. Does anyone now how to fix this? node .\compare-basic.js LOG (VoskAPI:ReadDataFiles():model.cc:213) Decoding params beam=13…
John2
  • 1
  • 2
0
votes
0 answers

Could not find a version error while Installing vosk in Pydroid 3

I have tried to install vosk in Pydroid 3 https://pypi.org/project/vosk/ But i got error, here is the log: Looking in indexes: http://repo.local ERROR: Could not find a version that satisfies the requirement vosk (from versions: none) ERROR: No…
Willl
  • 3
  • 2
0
votes
1 answer

I cannot install vosk in npm

I want install vosk in npm but receive the following error: npm ERR! code 1 npm ERR! path C:\Users\WINDOWS 11\node_modules\ffi-napi npm ERR! command failed npm ERR! command C:\WINDOWS\system32\cmd.exe /d /s /c node-gyp-build npm ERR! gyp info it…
0
votes
0 answers

How can I programmatically determine the sampling rate and bitrate of a WAV file?

There is a program written in python which recognizes text with the vosk library used. To recognize text you need to pass the sampling rate and the language model with this line : KaldiRecognizer(model, 96000), where 96000 is the frequency itself.…
Alerto
  • 11
  • 2
0
votes
0 answers

Vosk settings to set script

I am using below code block for Vosk speech-to-text from their example. Is there a way to set all text so the recognition became more accurate and maybe faster? I could not find any solid document on that one but found 2 things recognizer.Aligment…
Onur Topal
  • 3,042
  • 1
  • 24
  • 41
0
votes
0 answers

VOSK in Python on Mac not getting/reading microphone input

I am trying to use VOSK to transcribe voice input from my M1 MacBook Air microphone and the program runs fine, it just gets no input from the microphone. I found that if I use the larger English voice model with a '''exception_on_overflow =…
0
votes
0 answers

Speech to Text Model, where the model doesn't attempt to correct errors/grammer?

Is there an Vosk speech to text model, or any other open source/closed sourced model, where the model would output the spoken words into text. But it wouldn't correct them into proper words or fix their grammer, just output what they are saying in…
Deus
  • 1
  • 1
0
votes
0 answers

speech recognization problem while making a call on speakerphone

I'm using vosk freeswitch plugin to do speech recognization, and playAndGetDigits function to play an audio to user while doing speech recognization. There is a problem that while user is using speakerphone to make the call and is keeping quiet, the…
Allan
  • 191
  • 1
  • 10
0
votes
1 answer

how to get the details of ASR VOSK

I have working with Vosk and I need to get the time of each word in my file.mp3 this is my code def voice_recognition(filename): model = Model(model_name="vosk-model-fa-0.5") rec = KaldiRecognizer(model, FRAME_RATE) rec.SetWords(True) …
miladjurablu
  • 31
  • 1
  • 5