Jukebox: A Generative Model for Music (Paper Explained)

Channel:

Yannic Kilcher

Subscribers:

291,000

Published on May 2, 2020 11:06:26 AM ● Video Link: https://www.youtube.com/watch?v=1aO-uHXbzmQ

Duration: 33:46

21,405 views

696

This generative model for music can make entire songs with remarkable quality and consistency. It can be conditioned on genre, artist, and even lyrics.

Blog: https://openai.com/blog/jukebox/
Paper: https://cdn.openai.com/papers/jukebox.pdf
Code: https://github.com/openai/jukebox/

Abstract:
We introduce Jukebox, a model that generates music with singing in the raw audio domain. We tackle the long context of raw audio using a multiscale VQ-VAE to compress it to discrete codes, and modeling those using autoregressive Transformers. We show that the combined model at scale can generate high-fidelity and diverse songs with coherence up to multiple minutes. We can condition on artist and genre to steer the musical and vocal style, and on unaligned lyrics to make the singing more controllable. We are releasing thousands of non cherry-picked samples, along with model weights and code.

Authors: Prafulla Dhariwal, Heewoo Jun, Christine Payne, Jong Wook Kim, Alec Radford, Ilya Sutskever

Links:
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
BitChute: https://www.bitchute.com/channel/yannic-kilcher
Minds: https://www.minds.com/ykilcher

Other Videos By Yannic Kilcher

2020-05-12	Group Normalization (Paper Explained)
2020-05-11	Concept Learning with Energy-Based Models (Paper Explained)
2020-05-10	[News] Google’s medical AI was super accurate in a lab. Real life was a different story.
2020-05-09	Big Transfer (BiT): General Visual Representation Learning (Paper Explained)
2020-05-08	Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning (Paper Explained)
2020-05-07	WHO ARE YOU? 10k Subscribers Special (w/ Channel Analytics)
2020-05-06	Reinforcement Learning with Augmented Data (Paper Explained)
2020-05-05	TAPAS: Weakly Supervised Table Parsing via Pre-training (Paper Explained)
2020-05-04	Chip Placement with Deep Reinforcement Learning (Paper Explained)
2020-05-03	I talk to the new Facebook Blender Chatbot
2020-05-02	Jukebox: A Generative Model for Music (Paper Explained)
2020-05-01	[ML Coding Tips] Separate Computation & Plotting using locals
2020-04-30	The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies (Paper Explained)
2020-04-29	Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask (Paper Explained)
2020-04-28	[Rant] Online Conferences
2020-04-27	Do ImageNet Classifiers Generalize to ImageNet? (Paper Explained)
2020-04-26	[Drama] Schmidhuber: Critique of Honda Prize for Dr. Hinton
2020-04-25	How much memory does Longformer use?
2020-04-24	Supervised Contrastive Learning
2020-04-23	Thinking While Moving: Deep Reinforcement Learning with Concurrent Control
2020-04-22	[Rant] The Male Only History of Deep Learning

Tags:

deep learning

machine learning

arxiv

explained

neural networks

artificial intelligence

paper

music

vae

vq-vae

latent codes

quantization

sound

lyrics

sinatra

kanye

transformer

openai

Channel	Latest
Skyprince777	13 hours ago
Tsubasa Yozora Ch.	13 hours ago
USIX Pro Gaming	14 hours ago
alanzoka	20 hours ago
AnimeToons	20 hours ago
Flik's Gaming Stuff	20 hours ago
The Mexican Runner	22 hours ago
Beyond the Brick	22 hours ago
Spuffi	22 hours ago
442oons	1 day ago
Nintendo Life	1 day ago
Tamae	1 day ago
IntroGameOver	1 day ago
Dowell	1 day ago
Badaw Gaming	1 day ago
lugeyps3	1 day ago
CarbotAnimations	1 day ago
Pixelorez	1 day ago
Primal Koopa Pictures	1 day ago
BeastBoyShub	1 day ago
816	1 day ago
AoDzTo - อ๊อดโตะ	1 day ago
Chroma	1 day ago
Unnie Cj	1 day ago
Brecy	1 day ago