Train a VITS Speech Model using Coqui TTS | Updated Script and Audio Processing Tools VIDEO
Updated the audio processing tools for this notebook.
The VITS training loop will train or fine tune a model using the Coqui framework with phonemized text and speaker embeddings.
This is set up for English. It can be done in other labguages. It is easier for languages that are supported by the espeak-ng phonemizer.
Please read the documentation on the Coqui Github page.
https://github.com/coqui-ai/TTS
VITS Training Notebook
https://colab.research.google.com/drive/1Ff8wokT-12EuYKZAXtPu_mAIf5bLXQYS?usp=sharing
VCTK Hindi model - 22khz audio, 4 speakers
Trained on Mozilla Common Voice and Open Speech and Language Resources datasets to 376,500 steps
***DOWNLOAD TEMPORARILY LOST***
Thorsten-Voice's video on Windows setup
https://www.youtube.com/watch?v=bJjzSo_fOS8
Demucs
https://github.com/facebookresearch/demucs
FFMpeg-Normalize
https://github.com/slhck/ffmpeg-normalize
Other Videos By NanoNomad 2023-05-15 Train Tortoise TTS in English, Spanish, French, Italian, Portuguese, German, and more? Maybe? 2023-05-10 DEMO: Testing French-Speaking Tortoise TTS 2023-05-10 DEMO: Testing German-Speaking Tortoise TTS 2023-05-08 DEMO: Testing Spanish Speaking Tortoise TTS 2023-05-07 DEMO: Testing Tortoise TTS Speaking in Portuguese 2023-05-04 Make Using Tortoise TTS Faster with Fine-Tuned Models 2023-05-01 AI Voice Swap and Lip Sync using Wav2Lip-HQ-Updated 2023-04-22 Voice Cloning with Tortoise TTS and Model Training Using the AI Voice Cloning WebUI 2023-04-07 Locally Hosted Chatbots with RWKV through ChatRWKV and the Text-Generation-WebUI | 14B Model on 3GB! 2023-03-29 Create Datasets for Voice Model Training on Google Colab | Updated Tools for Coqui TTS Training 2023-03-22 Train a VITS Speech Model using Coqui TTS | Updated Script and Audio Processing Tools 2023-03-15 Training or Fine Tuning a Hindi Language VITS TTS Voice Model with Coqui TTS on Google Colab 2023-03-05 Install and Configure Retroarch for PS Vita with Thumbnails, Overlays and Shaders 2023-03-03 Fallout 1 on the PS Vita is the Best Way to Play 2023-02-24 Train or Fine Tune VITS on (theoretically) Any Language | Train Multi-Speaker Model | Train YourTTS 2023-02-12 Even more Voice Cloning | Train a Multi-Speaker VITS model using Google Colab and a Custom Dataset 2023-02-04 Updated | Near-Automated Voice Cloning | Whisper STT + Coqui TTS | Fine Tune a VITS Model on Colab 2023-01-30 YourTTS Training Discussion | Experiences, Multistage Training, Demos, Prior Training Preservation 2023-01-27 Updated | Fine-Tuning YourTTS with Automated STT Datasets on Google Colab for AI Voice Cloning 2023-01-13 Fine-Tune YourTTS with Near-Automated Datasets on Google Colab for AI Voice Cloning 2022-12-22 Near-Automated Voice Cloning | Whisper STT + Coqui TTS | Fine Tune a VITS Model on Colab or Linux
Tags: coqui tts
ai voice
voice cloning
tts
machine learning