Create Datasets for Voice Model Training on Google Colab | Updated Tools for Coqui TTS Training

Channel:
Subscribers:
2,340
Published on ● Video Link: https://www.youtube.com/watch?v=196h4JsqmZc



Category:
Vlog
Duration: 12:08
3,521 views
59


I've updated the Google Colab notebook that I use for making datasets. There are quite a few changes - added rnnoise again, added Demucs, changed audio normalization, added speaker diarization and segmentation, added segmentation with ffmpeg for long files. In this video I just go over the changes to the dataset tools, and take some audio files, process them, and run them through Whisper.

Updated dataset tools notebook with Coqui VITS model training:
https://colab.research.google.com/drive/1G54TjJjnzvA1Pc0IpPZoKVBS4N4G_k4T?usp=sharing

I go over training in some of the other videos:
https://www.youtube.com/watch?v=8v18u8PQXgs




Other Videos By NanoNomad


2023-05-16Tortoise TTS DEMO: G-Man performs Gilbert and Sullivan's 'The Major-General's Song'
2023-05-15Train Tortoise TTS in English, Spanish, French, Italian, Portuguese, German, and more? Maybe?
2023-05-10DEMO: Testing French-Speaking Tortoise TTS
2023-05-10DEMO: Testing German-Speaking Tortoise TTS
2023-05-08DEMO: Testing Spanish Speaking Tortoise TTS
2023-05-07DEMO: Testing Tortoise TTS Speaking in Portuguese
2023-05-04Make Using Tortoise TTS Faster with Fine-Tuned Models
2023-05-01AI Voice Swap and Lip Sync using Wav2Lip-HQ-Updated
2023-04-22Voice Cloning with Tortoise TTS and Model Training Using the AI Voice Cloning WebUI
2023-04-07Locally Hosted Chatbots with RWKV through ChatRWKV and the Text-Generation-WebUI | 14B Model on 3GB!
2023-03-29Create Datasets for Voice Model Training on Google Colab | Updated Tools for Coqui TTS Training
2023-03-22Train a VITS Speech Model using Coqui TTS | Updated Script and Audio Processing Tools
2023-03-15Training or Fine Tuning a Hindi Language VITS TTS Voice Model with Coqui TTS on Google Colab
2023-03-05Install and Configure Retroarch for PS Vita with Thumbnails, Overlays and Shaders
2023-03-03Fallout 1 on the PS Vita is the Best Way to Play
2023-02-24Train or Fine Tune VITS on (theoretically) Any Language | Train Multi-Speaker Model | Train YourTTS
2023-02-12Even more Voice Cloning | Train a Multi-Speaker VITS model using Google Colab and a Custom Dataset
2023-02-04Updated | Near-Automated Voice Cloning | Whisper STT + Coqui TTS | Fine Tune a VITS Model on Colab
2023-01-30YourTTS Training Discussion | Experiences, Multistage Training, Demos, Prior Training Preservation
2023-01-27Updated | Fine-Tuning YourTTS with Automated STT Datasets on Google Colab for AI Voice Cloning
2023-01-13Fine-Tune YourTTS with Near-Automated Datasets on Google Colab for AI Voice Cloning



Tags:
text to speech
coqui tts
ai voice
machine learning
artificial intelligence
audio dataset
demucs