Improve pronounciation of a model

Asked Apr 22 '21 at 15:08

Active Apr 22 '21 at 20:13

Viewed 25 times

I fine-tuned a dataset of Nvidia Tacotron2. While working reasonably well, there are some mispronounciations of words(I train a german dataset).

I have another set of wave files by the same speaker with according metadata.csv

How do I filter this to include mainly those sentences that teach the model the very pronounciations that are missing?

edited Apr 22 '21 at 20:13

GoodDeeds

asked Apr 22 '21 at 15:08

Nathan Pingel

0 Answers0