Bark TTS, Seamless Translation, RVC, Music Generation and More with the TTS Generation WebUI

Channel:
Subscribers:
2,260
Published on ● Video Link: https://www.youtube.com/watch?v=Y8J717tr9t0



Duration: 14:48
923 views
33


A look at the TTS Generation Web UI for Bark text to speech, generating music, translation and more. Just a broad overview of what seems to work well, and not so well in this feature-packed project.

[00:00] Topics
[01:20] Hardware/Software Used
[01:55] Installing and Running the WebUI
[02:40] Initial Setup
[02:55] Generating Speech with Bark
[04:10] Cleaning Audio with Demucs, Vocos
[05:10] Voice Cloning with Bark
[07:10] RVC Voice Conversion, Where to Download Models
[08:30] Generative Music with MusicGen
[10:40] Failing to Use Magnet
[11:30] Tortoise TTS
[12:50] Seamless M4T Text to Speech Translation

TTS Generation Webui: https://github.com/rsxdalv/tts-generation-webui

Sources for RVC Models:
https://rvc-models.com/
https://voice-models.com/
https://huggingface.co/spaces/zomehwh/rvc-models/tree/main/weights


If you want to use Tortoise, the MRQ repo is still what I like to use:
https://git.ecker.tech/mrq/ai-voice-cloning
https://www.youtube.com/watch?v=Fzah3eJabOY
https://www.youtube.com/watch?v=o7QRbMvFPzs

Completely unrelated to the video:
ASCII Animator 2.0 https://www.qqpr.com/ for animated ascii .gifs




Other Videos By NanoNomad


4 days agoRetroarch iOS Updates | Improved Performance, MS-DOS Core, Doom and Touch Input
2024-05-17RetroArch for iPad and iPhone now on the App Store | Installation, Setup, Quick Performance Overview
2024-05-13Micca Speck 4K Media Player | Unboxing, Firmware Update, Setup, Demos, and Opinions
2024-05-06Training SDXL to Generate Text Using IA3 LoRA | It's like Kai's Power Tools, I Guess?
2024-04-17Replacing Faulty Asus Phoenix RTX 3060 GPU Cooler - It's Easy
2024-03-21Bark TTS, Seamless Translation, RVC, Music Generation and More with the TTS Generation WebUI
2024-02-14Train Better Stable Diffusion Models | Prep Datasets Using this Free "Magic" Image Tool
2024-02-12Emulate a Sound Blaster in real MS-DOS on Modern Hardware | Retro Gaming on "Current" PCs
2024-01-28How to Play Hundreds of Point-and-Click Adventures on iOS for FREE with ScummVM with NO SIDELOADING
2024-01-18Training LoRAs and GLoRAs for Stable Diffusion 1.5 and XL Using the New Prodigy Optimizer
2024-01-03Nick Rekieta - Role Model (Voice Parody. It's silly. It's a joke.)
2023-11-19Automated Image Captioning with LLMs - Recognize Anything, BLIP-2, and Kosmos-2
2023-10-27Fine-Tuning Mistral 7B using QLoRA and PEFT on Unstructured Scraped Text Data | Making it Evil?
2023-09-20Exploring XTTS v1 and Tools to make Better Audio Datasets (the lazy way)
2023-09-01Es spricht Deutsch | Tortoise TTS Speaking German Demo Clip | Model Download Link in Description
2023-08-18AI Null reads Alice's Adventures in Wonderland by Lewis Carroll | Full Audiobook
2023-08-11Remove Background Music and Enhance Speech with Free AI Tools | Avoid ContentID
2023-08-06AI Null Reads Alice's Adventures in Wonderland by Lewis Carroll, Chapters 1 and 2 | joshcore
2023-07-30Are Text Cleaners Making Your TTS Models Sound Bad? | TTS Model Training Tips
2023-07-08.:Demo:. Tortoise TTS Expressive Speech narrating Norman Arkawy's 1955 Sci-Fi short "Selling Point"
2023-07-03.::Demo::. 4 Voice Multispeaker Tortoise TTS English Fine-Tuned Model Test :: Great Dictator Speech



Tags:
TTS
Bark TTS
MusicGen
RVC
Machine Translation
AI Models