Train a VITS Speech Model using Coqui TTS | Updated Script and Audio Processing Tools

Channel:
Subscribers:
2,860
Published on ● Video Link: https://www.youtube.com/watch?v=8v18u8PQXgs



Category:
Vlog
Duration: 14:43
4,689 views
72


Updated the audio processing tools for this notebook.
The VITS training loop will train or fine tune a model using the Coqui framework with phonemized text and speaker embeddings.
This is set up for English. It can be done in other labguages. It is easier for languages that are supported by the espeak-ng phonemizer.

Please read the documentation on the Coqui Github page.
https://github.com/coqui-ai/TTS

VITS Training Notebook
https://colab.research.google.com/drive/1Ff8wokT-12EuYKZAXtPu_mAIf5bLXQYS?usp=sharing


VCTK Hindi model - 22khz audio, 4 speakers
Trained on Mozilla Common Voice and Open Speech and Language Resources datasets to 376,500 steps
***DOWNLOAD TEMPORARILY LOST***

Thorsten-Voice's video on Windows setup
https://www.youtube.com/watch?v=bJjzSo_fOS8

Demucs
https://github.com/facebookresearch/demucs

FFMpeg-Normalize
https://github.com/slhck/ffmpeg-normalize




Other Videos By NanoNomad


2023-05-15Train Tortoise TTS in English, Spanish, French, Italian, Portuguese, German, and more? Maybe?
2023-05-10DEMO: Testing French-Speaking Tortoise TTS
2023-05-10DEMO: Testing German-Speaking Tortoise TTS
2023-05-08DEMO: Testing Spanish Speaking Tortoise TTS
2023-05-07DEMO: Testing Tortoise TTS Speaking in Portuguese
2023-05-04Make Using Tortoise TTS Faster with Fine-Tuned Models
2023-05-01AI Voice Swap and Lip Sync using Wav2Lip-HQ-Updated
2023-04-22Voice Cloning with Tortoise TTS and Model Training Using the AI Voice Cloning WebUI
2023-04-07Locally Hosted Chatbots with RWKV through ChatRWKV and the Text-Generation-WebUI | 14B Model on 3GB!
2023-03-29Create Datasets for Voice Model Training on Google Colab | Updated Tools for Coqui TTS Training
2023-03-22Train a VITS Speech Model using Coqui TTS | Updated Script and Audio Processing Tools
2023-03-15Training or Fine Tuning a Hindi Language VITS TTS Voice Model with Coqui TTS on Google Colab
2023-03-05Install and Configure Retroarch for PS Vita with Thumbnails, Overlays and Shaders
2023-03-03Fallout 1 on the PS Vita is the Best Way to Play
2023-02-24Train or Fine Tune VITS on (theoretically) Any Language | Train Multi-Speaker Model | Train YourTTS
2023-02-12Even more Voice Cloning | Train a Multi-Speaker VITS model using Google Colab and a Custom Dataset
2023-02-04Updated | Near-Automated Voice Cloning | Whisper STT + Coqui TTS | Fine Tune a VITS Model on Colab
2023-01-30YourTTS Training Discussion | Experiences, Multistage Training, Demos, Prior Training Preservation
2023-01-27Updated | Fine-Tuning YourTTS with Automated STT Datasets on Google Colab for AI Voice Cloning
2023-01-13Fine-Tune YourTTS with Near-Automated Datasets on Google Colab for AI Voice Cloning
2022-12-22Near-Automated Voice Cloning | Whisper STT + Coqui TTS | Fine Tune a VITS Model on Colab or Linux



Tags:
coqui tts
ai voice
voice cloning
tts
machine learning