Questions tagged [vosk]

Questions about Vosk Speech Recognition Library

Vosk is a speech recognition toolkit. The best things in Vosk are:

  • Supports 9 languages - English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese. More to come.
  • Works offline, even on lightweight devices - Raspberry Pi, Android, iOS
  • Installs with simple pip3 install vosk
  • Portable per-language models are only 50Mb each, but there are much bigger server models available.
  • Provides streaming API for the best user experience (unlike popular speech-recognition python packages)
  • There are bindings for different programming languages, too - java/csharp/javascript etc.
  • Allows quick reconfiguration of vocabulary for best accuracy.
  • Supports speaker identification beside simple speech recognition.
65 questions
0
votes
0 answers

Extract odd formed nested json to a python dataframe / extract from VOSK in standard JSON format

I am working with VOSK to recognise some speech. Until now I have just needed the final output and could get this from the below: def text_from_audio_orig( path, file, lang, location): …
James Oliver
  • 547
  • 1
  • 4
  • 17
0
votes
1 answer

I am using react-native-vosk package for speech recognition but how can I run it in background to listen continuously

I am using this pacakge'react-native-vosk' but the issue is I can't run this listener in background
0
votes
1 answer

Streaming and recording in the background using vosk in python

I want to record voices using vosk in the background, while in the main thread I still read my ultrasonic sensors values without stopping the script, but when I try to do so, threads don't run asynchronously
0
votes
0 answers

Vosk offline speech recognition accuracy issue on android

Here is the full config i'm using for android, Version: 0.3.32 Model : vosk-model-small-en-us-zamia-0.5 Gradle: 7.3.3-bin.zip Android studio: Chipmunk (2021.2.1 Patch 1) Scenario: I've build an offline voice recognition service which generally wake…
Rezwan
  • 61
  • 2
0
votes
0 answers

Android VOSK library with noise cancellation not working

I am using the VOSK offline Speech-to-text library with Agora video call for speech-to-text functionality. When I am using the VOSK microphone access the speaker sound as a speech and translate it. so is there any way to omit the speaker sound?
Jaydip Godhani
  • 185
  • 2
  • 14
0
votes
1 answer

Trying to open a py file inside Pycharm but it opens outside of Pycharm for some reason

elif "go online" in result["text"]: os.system("clear") stream.stop_stream() print("Sir: " + result["text"]) speak("Switching to online mode") os.system("open /Users/************/PycharmProjects/K.I.T.T/K" …
TAC
  • 21
  • 3
0
votes
0 answers

Vosk in Windows 10

I need help in using Vosk in Windows 10. I installed python 3.8, then used the pip install vosk command. The Python installation is at C:\Users\kamal\AppData\Local\Programs\Python\Python38 and I can see that vosk is at…
ksd
  • 1
0
votes
2 answers

Python doesn't find model in folder at Model(model_name="models/model-en-0.2) If Model(model_name="model-en-0.2)is writen then it works.How to fix it?

Python doesn't find the vosk model in the folder at Model(model_name="libs/models/vosk-model-small-en-0.22"). from vosk import Model, KaldiRecognizer FRAME_RATE = 16000 CHANNELS=1 model = Model(model_name="libs/models/vosk-model-small-en-0.22")…
Alexei
  • 277
  • 1
  • 2
  • 9
0
votes
1 answer

how to get customized output in text file

I have prepared a Speech to Text system using using Kaldi and vosk. Ive setup the directories and I am using vosk to give transcription of audio files. The pipeline is that I run bash script which basically takes in audio file name (without…
SAGE
  • 9
  • 3
0
votes
1 answer

Vosk gives error ModuleNotFoundError: No module named '_cffi_backend'

So basically, I have this speech recognition software in Python which worked flawlessly then my drive died and I had to buy a new one, reinstalled Windows, reinstalled Python (same version), and pip install vosk, then tried to run it. Now it all the…
0
votes
1 answer

Shared Object error while importing package

I'm actually trying to import a package in the python shell but I get a shared object error >>> from vosk import aligner Traceback (most recent call last): File "", line 1, in File…
Vikram Vm
  • 13
  • 3
0
votes
1 answer

Improving accuracy of speech recognition using Vosk (Kaldi) running on Android

I am developing an application to collect data in the field on Android devices using speech recognition. There are five "target words", as well as several numbers (zero, one, ten, one-hundred, etc) that are recognized. I have improved accuracy of…
portsample
  • 1,986
  • 4
  • 19
  • 35
0
votes
0 answers

How to use Vosk models from WebSocket online server?

I have been developing an android app that uses the speech recognition service but the android device has no Google app installed. For that reason, I'm using the vosk API for speech recognition but for better accuracy in speech recognition. I need…
0
votes
0 answers

How to multithread a program and return results to main.py?

I am using Windows and python. The objective is simple, running the main.py starts a speech recognition and once it recognizes what was said returns the text to main.py. The speech recognition program recognizes without any issues, the problem is at…
Benji
  • 15
  • 5
0
votes
1 answer

Compile file with vosk api with PyInstaller

I try to compile exe file from the python file with vosk library. Compilation is ok. Everytime when I try to run exe obtain this error: Traceback (most recent call last): File "Vega.py", line 10, in from vosk import Model,…
Alex97
  • 73
  • 2
  • 9