Train or Fine Tune VITS on (theoretically) Any Language | Train Multi-Speaker Model | Train YourTTS

Channel:
Subscribers:
2,340
Published on ● Video Link: https://www.youtube.com/watch?v=MU5157dKOHM



Duration: 15:48
3,335 views
75


VITS Multispeaker English Training and Fine Tuning Notebook:
https://colab.research.google.com/drive/178Nv5lmMdI1pMHmE0X0EsAxZEDZoklqQ?usp=sharing

VITS Alternate Language Training and Fine Tuning Notebook:
https://colab.research.google.com/drive/1zQXTel8AyqNvnnBLMItbzs-kUv51Dwat?usp=sharing

YourTTS Training and Fine Tuning notebook:
https://colab.research.google.com/drive/1MqiLjNaVNIEmD31A0s48U0GgyEHRk7Vo?usp=sharing

Updated YourTTS and VITS multi-speaker English-language notebooks. New notebook is for training a VITS model with languages other than English.

In this one I take a look at alternate language training a VITS model using Coqui TTS on Google Colab. I trained a Spanish-speaking model on mostly-blind sample data. I don't speak Spanish, so I can't evaluate this, but it started sounding pretty good for what it was.

Then I review some of the change/differences in the multispeaker VITS notebook and YourTTS notebook

Other videos:
Multispeaker VITS https://www.youtube.com/watch?v=45DiA-aJwXI
YourTTS training https://www.youtube.com/watch?v=1yt2W-uK8mk

Check out Unscripted Coding if you want to watch someone explore cool open source projects: https://www.youtube.com/@UnscriptedCoding

Download my multilingual, multispeaker YourTTS model on Huggingface: https://huggingface.co/AOLCDROM/YourTTS-Fr-En-De-Es
See allvoices.txt for information about each speaker:language training pair. Was trained on character sets, and uses 'artificial' language codes.

RTFM:
https://tts.readthedocs.io/en/latest/
https://github.com/openai/whisper
https://tts.readthedocs.io/en/latest/models/vits.html
https://arxiv.org/pdf/2106.06103.pdf




Other Videos By NanoNomad


2023-05-07DEMO: Testing Tortoise TTS Speaking in Portuguese
2023-05-04Make Using Tortoise TTS Faster with Fine-Tuned Models
2023-05-01AI Voice Swap and Lip Sync using Wav2Lip-HQ-Updated
2023-04-22Voice Cloning with Tortoise TTS and Model Training Using the AI Voice Cloning WebUI
2023-04-07Locally Hosted Chatbots with RWKV through ChatRWKV and the Text-Generation-WebUI | 14B Model on 3GB!
2023-03-29Create Datasets for Voice Model Training on Google Colab | Updated Tools for Coqui TTS Training
2023-03-22Train a VITS Speech Model using Coqui TTS | Updated Script and Audio Processing Tools
2023-03-15Training or Fine Tuning a Hindi Language VITS TTS Voice Model with Coqui TTS on Google Colab
2023-03-05Install and Configure Retroarch for PS Vita with Thumbnails, Overlays and Shaders
2023-03-03Fallout 1 on the PS Vita is the Best Way to Play
2023-02-24Train or Fine Tune VITS on (theoretically) Any Language | Train Multi-Speaker Model | Train YourTTS
2023-02-12Even more Voice Cloning | Train a Multi-Speaker VITS model using Google Colab and a Custom Dataset
2023-02-04Updated | Near-Automated Voice Cloning | Whisper STT + Coqui TTS | Fine Tune a VITS Model on Colab
2023-01-30YourTTS Training Discussion | Experiences, Multistage Training, Demos, Prior Training Preservation
2023-01-27Updated | Fine-Tuning YourTTS with Automated STT Datasets on Google Colab for AI Voice Cloning
2023-01-13Fine-Tune YourTTS with Near-Automated Datasets on Google Colab for AI Voice Cloning
2022-12-22Near-Automated Voice Cloning | Whisper STT + Coqui TTS | Fine Tune a VITS Model on Colab or Linux
2022-12-09Dreambooth and Fine Tuning for Stable Diffusion 1.5 and 2 with this Versatile Script
2022-11-30If Bill Gates could rap? AI Synthesized Voice, AI Upsampled Video | Deltron 3030's Virus
2022-11-14Training Stable Diffusion Dreambooth on Multiple Subjects for Combined Image Generation
2022-10-31Locally Train Stable Diffusion with Dreambooth using WSL Ubuntu



Tags:
voice cloning
AI voice
machine learning
tts
YourTTS
VITS