I'm trying to create a TTS engine for Indian Accented English (not any Indian language).
I already have a voice recordings database for Indian accented English. So what are the next steps ?
I think we need to label them with the ".lab" extension files (well I don't really know about it!). And what are the files with ".utts" extension for ?
What is the purpose of speaker adaptive training and speaker dependent training in implementing a TTS engine using HMM ?
I googled a lot but couldn't find a detailed explanation for them. (all I could find was some Papers and Journals related to it)
It would be really helpful if you could provide we with the links to resources which guide me in creating a custom TTS using the Hidden Marvkov Models.
Thank you.