The solution is to use Ubuntu in Windows through Windows Subsystem for Linux
- Get WSL for Windows
- From your ubuntu bash navigate to the folder where you want to do the setup. You can access the Windows file system from the
/mnt/c/
folder, which you can find at the root directory.
- From there simply follow the official instructions, that is clone the git repo, and run
cmake ..
& make -j2
in order to build the project (after first making the necessary installations in your Ubuntu system).
Obviously, you must train the models or scorers using the Linux bash. You can also use these models from Windows using the kenlm python library.
E.g.
The two steps to build a scorer for the deepspeech-model
as described here should be executed from your Ubuntu system. But after you have the scorer you should be able to run the command
deepspeech --model deepspeech-0.9.3-models.pbmm --scorer kenlm.scorer --audio audio.wav
from Windows. However, once you have WSL there's no need to do this work from Windows. Things will work nicely @your Ubuntu system.