Fine Tuning XTTS v2 for Hindi Speech with forked Coqui TTS VIDEO
I heard a few people say that the base XTTSv2 2.0.3 model doesn't produce very good Hindi output. Here I go through preparing some Hindi speech datasets using the Common Voice version 18 and Indic TTS Database datasets. I put together a handful of metadata conversion scripts, cleanup scripts, and batch files for this.
You can copy and paste them from here:
http://nanonomad.com/2024/07/03/xttsv2-hindi-finetuning/
The fine-tuned XTTSv2 checkpoints and speaker reference clips are here:
https://huggingface.co/AOLCDROM/XTTSv2-Hi_ft/tree/main
Note: The highest step count may not necessarily be the best quality output for whatever speaker you are trying to use. Some checkpoints may repeat the end of sentences more than others.
Previous XTTS v2 video: • Fine Tuning XTTS v2 with forked Coqui...
Other Videos By NanoNomad 2025-08-09 Saturday Morning Console Wars: 40 Minutes of Restored Retro Console Commercials 2025-08-06 MS-DOS and Windows XP Gaming on a Thinkpad X61 [SoundBlaster Emulation with MIDI in DOS] 2025-07-11 Using Flux Kontext in Krita with the Generative AI Plugin 2025-04-17 DiffRhythm: Generative Music (done quickly) 2025-02-25 Is YuE the Stable Diffusion of Music? | Generate Full-Length Songs with Vocals at Home 2025-02-10 Portable Whisper Speech to Text with Speaker Diarization and VAD | Purfview Faster Whisper XXL 2024-07-03 Fine Tuning XTTS v2 for Hindi Speech with forked Coqui TTS 2024-06-26 Fine Tuning XTTS v2 with forked Coqui | Coqui AI is dead; Long live Coqui! 2024-06-20 2x Faster LLM Training on Windows | LLaMA-Factory with Unsloth and Flash Attention 2 2024-06-15 64kb Scene Demo/Intro/Cracktro Multimedia Mix #1 (90 min) | Flash/Photo-sensitivity Warning 2024-06-10 Stable Audio Open 1.0 | Open Source* Generative Audio and Fine Tuning* 2024-06-04 Troubleshooting Sega Saturn Emulation with Retroarch for iOS/Apple 2024-05-29 Play Windows 98 and MS-DOS Games on iPad/iOS/iPhone with DOSBox-Pure and Retroarch for FREE 2024-05-25 The Lost Art of Optical Disc Repair | Fixing and Testing a PlayStation Disc 2024-05-22 Retroarch iOS Updates | Improved Performance, MS-DOS Core, Doom and Touch Input 2024-05-17 RetroArch for iPad and iPhone now on the App Store | Installation, Setup, Quick Performance Overview 2024-05-13 Micca Speck 4K Media Player | Unboxing, Firmware Update, Setup, Demos, and Opinions 2024-05-06 Training SDXL to Generate Text Using IA3 LoRA | It's like Kai's Power Tools, I Guess? 2024-04-17 Replacing Faulty Asus Phoenix RTX 3060 GPU Cooler - It's Easy 2024-03-21 Bark TTS, Seamless Translation, RVC, Music Generation and More with the TTS Generation WebUI 2024-02-14 Train Better Stable Diffusion Models | Prep Datasets Using this Free "Magic" Image Tool
Tags: text to speech
tts
xtts
coqui
ai