Highest Voted 'kaldi' Questions

1

vote

1 answer

Speaker segmentation using Kaldi's x-vector approach

I'm using kaldi for asr and now I want to do speaker segmentation using Kaldi's x-vector approach. They are providing some example segmentation scripts at https://github.com/kaldi-asr/kaldi/tree/master/egs/sre16/v2 .They also provide a basic…

neural-network speech-to-text kaldi

asked May 23 '18 at 08:31

humble_wolf

1,497
19
26

1

vote

1 answer

sclite (SCTK) `make check` faliure, C++/perl/Cygwin, Safe to use Perl4 stuff?

I am currently trying to install NIST's sclite, which is part of SCTK 2.4.0 (github or newer version). I am attempting the install on Cygwin in bash. The installation is done using make. I have gotten past the make configure and make all parts of…

c++ perl makefile kaldi perl4

asked May 10 '18 at 23:58

bballdave025

1,347
1
15
28

1

vote

0 answers

Error in getting feature dimension on Kaldi for Voice recognition?

I have done 'Kaldi for dummies' example for Voice recognition. But I am getting the following error. Anyone knows how to fix it? error message showing while running Kaldi for dummies

voice-recognition kaldi

asked Mar 16 '18 at 04:20

Geo George

31
1
4

1

vote

2 answers

How to setup Kaldi on Mac?

I have gone through the official documentation of kaldi, it is very hard to understand. can anyone suggest me a good tutorial which discribes how to run kaldi on mac??

macos kaldi

asked Feb 28 '18 at 06:42

Geo George

31
1
4

1

vote

1 answer

cudaError_t 1 : "global function call is not configured" returned from 'cublasCreate(&handle_)'

I run ASR experiment using Kaldi on SGE cluster consisting of two workstation with TITAN XP. And randomly I meet the following problem: ERROR (nnet3-train[5.2.62~4-a2342]:FinalizeActiveGpu():cu-device.cc:217) cudaError_t 1 : "__global__ function…

gpu kaldi nvidia-titan

asked Dec 21 '17 at 08:42

haibing cao

11
2

1

vote

1 answer

Kaldi librispeech data preparation error

I'm trying to do ASR system. Im using kaldi manual and librispeech corpus. In data preparation step i get this error utils/data/get_utt2dur.sh: segments file does not exist so getting durations from wave files utils/data/get_utt2dur.sh: could not…

bash perl kaldi

asked Sep 24 '17 at 20:56

Eclipser

49
3

1

vote

1 answer

Aligning class labels to data point on a waveform plot

I am currently trying to align extracted target values to a waveform plot. the target values, extracted are phoneme classes determining whether the word being uttered is (yes, no, silence). I've been trying to properly align the target values with…

python matplotlib plot speech-recognition kaldi

asked Aug 09 '17 at 10:25

Lamda

914
3
13
39

1

vote

1 answer

Kaldi feature dimension error

I run the kaldi tutorial for dummies and i get this error "error getting feature dimension". Can anybody help? Thank you for your time!

kaldi

asked May 30 '17 at 12:24

Kosta

31
3

1

vote

1 answer

Doing phoneme recognition given different sized audio files?

I am currently working on doing phoneme recognition with cnn. My dataset is labeled, but I am bit unsure how i ensure that the length of feature vector also will be according to the length of the audio file. My input to the CNN is currently a…

python machine-learning speech-recognition pattern-recognition kaldi

asked Mar 19 '17 at 19:34

J.Down

700
1
9
32

1

vote

1 answer

Extract MFCC coefficient without the log?

I am currently trying to replicate the works of a paper, in which they train a cnn using MFCC features without the DCT performed at the end. It is basically the log of the energies of the filter banks. I know that kaldi can compute the MFCC…

audio speech-recognition speech mfcc kaldi

asked Mar 13 '17 at 10:55

J.Down

700
1
9
32

1

vote

4 answers

Kaldi toolkit Installation Error on Ubuntu 16.04

Hi I am trying to install Kaldi toolkit for speech recognition on Ubuntu 16.04. I have followed instructions in INSTALL file. (1) go to tools/ and follow INSTALL instructions there. Followed procedure in INSTALL file in tools and got following…

linux ubuntu installation ubuntu-16.04 kaldi

asked Mar 08 '17 at 21:25

Srinivas

176
2
12

1

vote

1 answer

Unable to extract delta and delta delta power spectrum computation

I am currently trying to extract the delta + delta-delta using add-deltas binary file provided by kaldi. But for some reason i am not able to extract it. I usually extract power spectrum using the make_spectrum.sh script. I modified it a bit to…

speech-recognition kaldi

asked Feb 09 '17 at 01:01

Loser

67
8

0

votes

0 answers

Adapting a Kaldi model not recognizing words

I began using Kaldi to train a model for use with VOSK in python. The issue i am experiencing is with recognizing certain words such as "manor". I have tried adding multiple variation in the lexicon file such as M AE N ER M AE N AO M AE N AA M AE…

speech kaldi

asked Aug 08 '23 at 12:00

Clinton

347
1
2
14

0

votes

0 answers

Understanding Lexicon FST in yesno example of Kaldi

I am running yesno example in Kaldi. I could get 0 WER at the output, which means things are going right. However, when I tried to view the L.FST, I see some ambiguity. Consider the above L.FST , how to choose the next state from the current state…

nlp speech-recognition state-machine kaldi finite-state-automaton

asked Jul 11 '23 at 06:09

Anantha Krishnan

13
3

0

votes

0 answers

How does forced alignment happen in Kaldi?

I am going through the popular 'yesno' tutorial to get comfortable with Kaldi toolkit. For acoustic model training we will need the start and end times of each utterance, the speaker ID of each utterance, and a list of all words and phonemes present…

nlp speech-recognition state-machine kaldi finite-state-automaton

asked Jul 11 '23 at 04:20

Anantha Krishnan

13
3

Questions tagged [kaldi]