The script looks for models under the models/vosk and models/recasepunc folders.Ī typical folder structure would look something like this (recasepunc models can either be in their own folder or by themselves, depending on which source you download them from. Recasepunc is technically optional when using vosk, but highly recommended to improve the output. For additional ones, you can look in the recasepunc repo.įor english I use vosk-model-en-us-0.22 and vosk-recasepunc-en-0.22. The same page also offers some recasepunc models. If you're looking to use the vosk/recasepunc and you need something besides the included (downloadable) models, read on. In the script select your normal microphone as input, VB-Cable input as the output, then on discord select VB-Cable output as the input. Let’s take a look at some of the most common use cases of Windows Speech Recognition. If you would like to use the voice on something like discord, use VB-Cable. Using speech recognition in Windows 10 or Windows 11. Install the requirements: pip install -r requirements.txt If you did it correctly, there should be (venv) at the start of the command line. Run run.bat - it will handle all the following steps for you. You can follow this tutorial if you're on windowsĪdditionally, if you're on linux, you'll need to make sure portaudio is installed. I'd recommend using python 3.10.6īefore anything else: you'll need to have ffmpeg in your $PATH. Warning: Python 3.11 is still not fully supported by pytorch (but it should work on the nightly build). The project also allows you to synchronize the detected text with an OBS text source using obsws-python.
0 Comments
Leave a Reply. |