Updated | Fine-Tuning YourTTS with Automated STT Datasets on Google Colab for AI Voice Cloning

Channel:

NanoNomad

Subscribers:

2,860

Published on January 27, 2023 11:08:11 AM ● Video Link: https://www.youtube.com/watch?v=58IqrrXMxQo

Category:

Vlog

Duration: 18:28

2,370 views

A followup to the YourTTS video. Here you can fine-tune a multispeaker YourTTS model using your own voice samples. The samples are split, converted, run through rnnoise to denoise, transcribed with OpenAI Whisper STT, then put into a VCTK-format dataset, and used to fine tune the YourTTS model using Coqui TTS.

The script is currently configured for English voices only. Other languages require separate datasets and things here are hardcoded for English for ease of use.

It probably mostly works if you have a good dataset. Probably. No promises.

Updated Whisper STT+Coqui YourTTS Colab Script:
https://colab.research.google.com/drive/1GsOL7pwCrECagRxmOgJoKOHtlaGhUCma?usp=sharing

WaveShop:
https://waveshop.sourceforge.net/download.html

Sonic Visualiser:
https://www.sonicvisualiser.org/

Coqui's Dataset Guide:
https://github.com/coqui-ai/TTS/wiki/What-makes-a-good-TTS-dataset

rnnoise:
https://github.com/xiph/rnnoise

Generate text with the CLI:
tts --text "text" --out_path outfile.wav --model_path multivoice/traineroutput/run path/best_model.pth --config_path multivoice/traineroutput/run path/config.json --speakers_file_path multivoice/speakers.pth --speaker_idx VCTK_speaker

Other Videos By NanoNomad

2023-04-07	Locally Hosted Chatbots with RWKV through ChatRWKV and the Text-Generation-WebUI \| 14B Model on 3GB!
2023-03-29	Create Datasets for Voice Model Training on Google Colab \| Updated Tools for Coqui TTS Training
2023-03-22	Train a VITS Speech Model using Coqui TTS \| Updated Script and Audio Processing Tools
2023-03-15	Training or Fine Tuning a Hindi Language VITS TTS Voice Model with Coqui TTS on Google Colab
2023-03-05	Install and Configure Retroarch for PS Vita with Thumbnails, Overlays and Shaders
2023-03-03	Fallout 1 on the PS Vita is the Best Way to Play
2023-02-24	Train or Fine Tune VITS on (theoretically) Any Language \| Train Multi-Speaker Model \| Train YourTTS
2023-02-12	Even more Voice Cloning \| Train a Multi-Speaker VITS model using Google Colab and a Custom Dataset
2023-02-04	Updated \| Near-Automated Voice Cloning \| Whisper STT + Coqui TTS \| Fine Tune a VITS Model on Colab
2023-01-30	YourTTS Training Discussion \| Experiences, Multistage Training, Demos, Prior Training Preservation
2023-01-27	Updated \| Fine-Tuning YourTTS with Automated STT Datasets on Google Colab for AI Voice Cloning
2023-01-13	Fine-Tune YourTTS with Near-Automated Datasets on Google Colab for AI Voice Cloning
2022-12-22	Near-Automated Voice Cloning \| Whisper STT + Coqui TTS \| Fine Tune a VITS Model on Colab or Linux
2022-12-09	Dreambooth and Fine Tuning for Stable Diffusion 1.5 and 2 with this Versatile Script
2022-11-30	If Bill Gates could rap? AI Synthesized Voice, AI Upsampled Video \| Deltron 3030's Virus
2022-11-14	Training Stable Diffusion Dreambooth on Multiple Subjects for Combined Image Generation
2022-10-31	Locally Train Stable Diffusion with Dreambooth using WSL Ubuntu
2022-10-25	Animated Stable Diffusion and Synthesized Voice Demo with Facial Movements
2022-10-24	Stable Diffusion Image to Video, Synthesized Lauretta Young 1930s voice, Wav2Lip Demo
2022-10-16	Animate Images using AI with Frame Interpolation for Large Motion
2022-10-14	Animated Stable Diffusion Images using Google's FILM Frame Interpolation for Large Motion demo

Tags:

Voice Cloning

YourTTS

Coqui

Whisper

AI Voice

Channel	Latest
KidTrigger	6 hours ago
Valorant DAILY	6 hours ago
André Roronora Zoro	6 hours ago
Dendi	6 hours ago
Sayro Digital	6 hours ago
LOLREC	7 hours ago
domisumReplay: Twisted Fate	7 hours ago
domisumReplay: Malphite	7 hours ago
Rodolfo FreeToPlayer	7 hours ago
Docwel	7 hours ago
Mundo Variado	7 hours ago
MinMumCS2	7 hours ago
Смотри & Учись	7 hours ago
AI Music Horizon	7 hours ago
domisumReplay: Ryze	8 hours ago
Vithy	8 hours ago
Vitor Hugo 🐻	8 hours ago
Attix	8 hours ago
Garena RoV Thailand	8 hours ago
Deadlock Pro Gameplay [Watch & Learn]	8 hours ago
Dean Fraiser	8 hours ago
Wideo Wideo	8 hours ago
domisumReplay: Cassiopeia	8 hours ago
World Gamers	8 hours ago
Data Real	8 hours ago