Is YuE the Stable Diffusion of Music? | Generate Full-Length Songs with Vocals at Home

Channel:

NanoNomad

Subscribers:

2,970

Published on February 25, 2025 9:49:53 AM ● Video Link: https://www.youtube.com/watch?v=hNQA4JxPTn8

Duration: 0:00

493 views

Suno and Udio may finally have some open source competition. Here I take a quick look at using the new M-A-P YuE music generation model via the YuE-Extend project. This is a lyrics-to-mixed audio model that can take in annotated lyrics and output full-length, mixed songs.
This supports exllamav2 quantized models and music extension (use the -icl models. I don't cover extension in the video, but the GUI has instructions). This is mostly just an introduction to this handy project by Mozer with plenty of examples of the YuE model being used throughout the video. All music used was generated by M-A-P YuE (mostly the exl2-8 model).

YuE-Extend:
https://github.com/Mozer/YuE-extend

I forgot to note it in the video - if you generate a track you like in the -extend WebUI, save the stems (vocal/instrumental) from the output folder before they are mixed. Theyre overwritten on every generation. The final tracks have timestamped names, but the intermediate tracks will be lost if you dont copy them.

M-A-P YuE Project:
https://map-yue.github.io/

M-A-P YuE Official Code:
https://github.com/multimodal-art-projection/YuE

YuE w/ exllamav2 loading:
https://github.com/sgsdxzy/YuE-exllamav2

GifCities for sick gifs from 1997:
https://gifcities.org/

Other Videos By NanoNomad

2025-08-09	Saturday Morning Console Wars: 40 Minutes of Restored Retro Console Commercials
2025-08-06	MS-DOS and Windows XP Gaming on a Thinkpad X61 [SoundBlaster Emulation with MIDI in DOS]
2025-07-11	Using Flux Kontext in Krita with the Generative AI Plugin
2025-04-17	DiffRhythm: Generative Music (done quickly)
2025-02-25	Is YuE the Stable Diffusion of Music? \| Generate Full-Length Songs with Vocals at Home
2025-02-10	Portable Whisper Speech to Text with Speaker Diarization and VAD \| Purfview Faster Whisper XXL
2024-07-03	Fine Tuning XTTS v2 for Hindi Speech with forked Coqui TTS
2024-06-26	Fine Tuning XTTS v2 with forked Coqui \| Coqui AI is dead; Long live Coqui!
2024-06-20	2x Faster LLM Training on Windows \| LLaMA-Factory with Unsloth and Flash Attention 2
2024-06-15	64kb Scene Demo/Intro/Cracktro Multimedia Mix #1 (90 min) \| Flash/Photo-sensitivity Warning
2024-06-10	Stable Audio Open 1.0 \| Open Source* Generative Audio and Fine Tuning*
2024-06-04	Troubleshooting Sega Saturn Emulation with Retroarch for iOS/Apple
2024-05-29	Play Windows 98 and MS-DOS Games on iPad/iOS/iPhone with DOSBox-Pure and Retroarch for FREE
2024-05-25	The Lost Art of Optical Disc Repair \| Fixing and Testing a PlayStation Disc
2024-05-22	Retroarch iOS Updates \| Improved Performance, MS-DOS Core, Doom and Touch Input
2024-05-17	RetroArch for iPad and iPhone now on the App Store \| Installation, Setup, Quick Performance Overview
2024-05-13	Micca Speck 4K Media Player \| Unboxing, Firmware Update, Setup, Demos, and Opinions
2024-05-06	Training SDXL to Generate Text Using IA3 LoRA \| It's like Kai's Power Tools, I Guess?
2024-04-17	Replacing Faulty Asus Phoenix RTX 3060 GPU Cooler - It's Easy
2024-03-21	Bark TTS, Seamless Translation, RVC, Music Generation and More with the TTS Generation WebUI
2024-02-14	Train Better Stable Diffusion Models \| Prep Datasets Using this Free "Magic" Image Tool

Channel	Latest
Kamar Rama	6 hours ago
USIX Pro Gaming	6 hours ago
AnimeToons	8 hours ago
AngryJoeShow	9 hours ago
Skyprince777	10 hours ago
Nintendo of America	10 hours ago
Anton Petrov	11 hours ago
PopCross Studios	12 hours ago
alanzoka	12 hours ago
Aaronitmar	13 hours ago
IGN	13 hours ago
Kage848	13 hours ago
CHAQN2	13 hours ago
JoBlo Animated Videos	14 hours ago
Chroma	14 hours ago
Goodblue77	14 hours ago
ZGadgetReview	14 hours ago
Skurry	15 hours ago
Pecel Boy	15 hours ago
woclips	15 hours ago
DENZ TVLOG	15 hours ago
JMGames	15 hours ago
Syaoran	15 hours ago
Ney Games	15 hours ago
Nostradamus	15 hours ago