I am trying to implement speech to text on my .net core 3.1 web app without using any cloud services like Azure, AWS, or Google.
I have found options like deepspeech, kaldi,pocketpheonix. It looks like they can be implemented on a .net core app but being a beginner in .net core and backend development I don't know how to use them and their documentation is not very beginner-friendly.
So if you can provide a brief overview of how to implement these apis on .net core or link to some articles, that would be very helpful.