Voice Cloning Tutorial with Coqui TTS and Google Colab | Fine Tune Your Own VITS Model for Free

Channel:
Subscribers:
2,340
Published on ● Video Link: https://www.youtube.com/watch?v=6QAGk_rHipE



Category:
Tutorial
Duration: 17:12
27,995 views
713


12/22/22 - Follow-up video with different notebook and Linux install instructions, https://www.youtube.com/watch?v=e_DCb1XPWS0 - Automated Dataset Creation with Whisper STT + VITS Training with Coqui TTS

***!!!***IMPORTANT***!!!*** 11/29/22: New notebook added, no need to manually upload training scripts anymore. Just set the variables and run the cells. The scripts will be output to files on your Google Drive:
https://colab.research.google.com/drive/1N_B_38MMRk1BUqwI_C829TyGpNmppUqK?usp=sharing

A quick and dirty voice cloning tutorial. How to fine tune a VITS voice model using the Coqui TTS framework on Google Colab.

Follow along and see how to make a voice model like the Bill Gates one used in this: https://www.youtube.com/watch?v=1ztm7aBssgA (Bill Gates raps to Deltron 3030's Virus)

https://pastebin.com/HsdBjGib - save this as rnnoise.py and upload it to Google Drive

NEW INFO 9/13/22. UPDATED COQUI 0.8.0 MODEL. FASTER TRAINING. BETTER QUALITY.
https://pastebin.com/UUm7cS78 save as finetune_vits.ipynb and open/upload to Google Colab
https://pastebin.com/UjzSkSsx save as train_vits.py to google drive

OLD INFO:
https://pastebin.com/6Uf8syzJ - ***NOTE: 9/3/22 Colab script updated to fix restore fine tuning. save this as Voice_Clone.ipnyb and upload it to Google Colab or your Google Drive. Then, in Colab, select open from Drive.
https://pastebin.com/6TBGzbQY - save this as train-vits-bg-colab.py in your google drive folder

​@ThorstenMueller For great Coqui TTS and Mycroft videos
https://github.com/coqui-ai/TTS - Coqui TTS site
https://www.audacityteam.org/ - Audacity editor
http://waveshop.sourceforge.net/ - WaveShop editor
https://www.sonicvisualiser.org/ - Sonic Visualiser
https://www.gyan.dev/ffmpeg/builds/ - FFmpeg Windows Builds
https://notepad-plus-plus.org/ - Notepad++




Other Videos By NanoNomad


2022-10-07Training Textual Inversion for Stable Diffusion | Customizable AI Image Generation
2022-09-26How to Download All Styles and Objects from the Stable Diffusion Concepts Library | AI Images
2022-09-05AI Images | Installing Stable Diffusion and the Automatic1111 WebUI using Conda on Windows 10
2022-09-04AI Image Generation with Stable Diffusion Part 2 | Img2Img Transformations, Masking, Upscaling
2022-09-01AI Image Generation with Stable Diffusion | Part 1
2022-08-28Johnny Cash Delivers The Great Dictator Speech | AI Voice Demo VITS Model with Stable Diffusion art
2022-08-23Duke Nukem covering DJ Shadow feat Run The Jewels' Nobody Speak | AI Voice Synthesis
2022-08-15Turn Low Resolution AI Generated Images into HD Glitch Videos | What if H.R. Giger Designed Minions?
2022-08-13Play Ultima IV IV VII VIII on PS Vita
2022-08-02Play Fallout 2 on the Sony PlayStation Vita with this New Engine Source Port
2022-07-27Voice Cloning Tutorial with Coqui TTS and Google Colab | Fine Tune Your Own VITS Model for Free
2022-07-20What if Bill Gates could rap? VITS voice model covering Deltron 3030's Virus
2022-03-25Moepoofles' bird of wisdom
2022-02-18Powering up a Victor 305n 386SX/20mhz Laptop After Bypassing the Battery
2022-02-02Bird Calls - Happy Budgies Singing in the Afternoon
2022-01-27Let's Make a Portable Arcade and Emulator Machine using a Broken Android Phone
2022-01-21Ultima Underworld II for MS-DOS Intro and Menu Music from Roland SoundCanvas SC-55 MkII MIDI Module
2022-01-21Retroarch Problems? Roms and Overlays Not Loading? Fix File Permissions on Xbox One and Series
2022-01-16Using a Roland SC-55mkII MIDI Module with DOSBox and SCUmmVM in Windows XP and Windows 10
2022-01-09Emulate almost anything with Retroarch on a Retail Xbox One with Shaders and Overlays
2022-01-01Retroarch PS Vita Part 2 - Customized Playlists, Dynamic Backgrounds, Shaders and Video Filters