Is YuE the Stable Diffusion of Music? | Generate Full-Length Songs with Vocals at Home

Channel:
Subscribers:
2,970
Published on ● Video Link: https://www.youtube.com/watch?v=hNQA4JxPTn8



Duration: 0:00
493 views
0


Suno and Udio may finally have some open source competition. Here I take a quick look at using the new M-A-P YuE music generation model via the YuE-Extend project. This is a lyrics-to-mixed audio model that can take in annotated lyrics and output full-length, mixed songs.
This supports exllamav2 quantized models and music extension (use the -icl models. I don't cover extension in the video, but the GUI has instructions). This is mostly just an introduction to this handy project by Mozer with plenty of examples of the YuE model being used throughout the video. All music used was generated by M-A-P YuE (mostly the exl2-8 model).

YuE-Extend:
https://github.com/Mozer/YuE-extend

I forgot to note it in the video - if you generate a track you like in the -extend WebUI, save the stems (vocal/instrumental) from the output folder before they are mixed. Theyre overwritten on every generation. The final tracks have timestamped names, but the intermediate tracks will be lost if you dont copy them.

M-A-P YuE Project:
https://map-yue.github.io/

M-A-P YuE Official Code:
https://github.com/multimodal-art-projection/YuE

YuE w/ exllamav2 loading:
https://github.com/sgsdxzy/YuE-exllamav2

GifCities for sick gifs from 1997:
https://gifcities.org/




Other Videos By NanoNomad


2025-08-09Saturday Morning Console Wars: 40 Minutes of Restored Retro Console Commercials
2025-08-06MS-DOS and Windows XP Gaming on a Thinkpad X61 [SoundBlaster Emulation with MIDI in DOS]
2025-07-11Using Flux Kontext in Krita with the Generative AI Plugin
2025-04-17DiffRhythm: Generative Music (done quickly)
2025-02-25Is YuE the Stable Diffusion of Music? | Generate Full-Length Songs with Vocals at Home
2025-02-10Portable Whisper Speech to Text with Speaker Diarization and VAD | Purfview Faster Whisper XXL
2024-07-03Fine Tuning XTTS v2 for Hindi Speech with forked Coqui TTS
2024-06-26Fine Tuning XTTS v2 with forked Coqui | Coqui AI is dead; Long live Coqui!
2024-06-202x Faster LLM Training on Windows | LLaMA-Factory with Unsloth and Flash Attention 2
2024-06-1564kb Scene Demo/Intro/Cracktro Multimedia Mix #1 (90 min) | Flash/Photo-sensitivity Warning
2024-06-10Stable Audio Open 1.0 | Open Source* Generative Audio and Fine Tuning*
2024-06-04Troubleshooting Sega Saturn Emulation with Retroarch for iOS/Apple
2024-05-29Play Windows 98 and MS-DOS Games on iPad/iOS/iPhone with DOSBox-Pure and Retroarch for FREE
2024-05-25The Lost Art of Optical Disc Repair | Fixing and Testing a PlayStation Disc
2024-05-22Retroarch iOS Updates | Improved Performance, MS-DOS Core, Doom and Touch Input
2024-05-17RetroArch for iPad and iPhone now on the App Store | Installation, Setup, Quick Performance Overview
2024-05-13Micca Speck 4K Media Player | Unboxing, Firmware Update, Setup, Demos, and Opinions
2024-05-06Training SDXL to Generate Text Using IA3 LoRA | It's like Kai's Power Tools, I Guess?
2024-04-17Replacing Faulty Asus Phoenix RTX 3060 GPU Cooler - It's Easy
2024-03-21Bark TTS, Seamless Translation, RVC, Music Generation and More with the TTS Generation WebUI
2024-02-14Train Better Stable Diffusion Models | Prep Datasets Using this Free "Magic" Image Tool