Reinforcement Learning with Augmented Data (Paper Explained)

Channel:

Yannic Kilcher

Subscribers:

291,000

Published on May 6, 2020 1:02:50 PM ● Video Link: https://www.youtube.com/watch?v=to7vCdkLi4s

Duration: 22:15

6,573 views

211

This ONE SIMPLE TRICK can take a vanilla RL algorithm to achieve state-of-the-art. What is it? Simply augment your training data before feeding it to the learner! This can be dropped into any RL pipeline and promises big improvements across the board.

Paper: https://arxiv.org/abs/2004.14990
Code: https://www.github.com/MishaLaskin/rad

Abstract:
Learning from visual observations is a fundamental yet challenging problem in reinforcement learning (RL). Although algorithmic advancements combined with convolutional neural networks have proved to be a recipe for success, current methods are still lacking on two fronts: (a) sample efficiency of learning and (b) generalization to new environments. To this end, we present RAD: Reinforcement Learning with Augmented Data, a simple plug-and-play module that can enhance any RL algorithm. We show that data augmentations such as random crop, color jitter, patch cutout, and random convolutions can enable simple RL algorithms to match and even outperform complex state-of-the-art methods across common benchmarks in terms of data-efficiency, generalization, and wall-clock speed. We find that data diversity alone can make agents focus on meaningful information from high-dimensional observations without any changes to the reinforcement learning method. On the DeepMind Control Suite, we show that RAD is state-of-the-art in terms of data-efficiency and performance across 15 environments. We further demonstrate that RAD can significantly improve the test-time generalization on several OpenAI ProcGen benchmarks. Finally, our customized data augmentation modules enable faster wall-clock speed compared to competing RL techniques. Our RAD module and training code are available at this https URL.

Authors: Michael Laskin, Kimin Lee, Adam Stooke, Lerrel Pinto, Pieter Abbeel, Aravind Srinivas

Links:
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
BitChute: https://www.bitchute.com/channel/yannic-kilcher
Minds: https://www.minds.com/ykilcher

Other Videos By Yannic Kilcher

2020-05-16	[News] Facebook's Real-Time TTS system runs on CPUs only!
2020-05-15	Weight Standardization (Paper Explained)
2020-05-14	[Trash] Automated Inference on Criminality using Face Images
2020-05-13	Faster Neural Network Training with Data Echoing (Paper Explained)
2020-05-12	Group Normalization (Paper Explained)
2020-05-11	Concept Learning with Energy-Based Models (Paper Explained)
2020-05-10	[News] Google’s medical AI was super accurate in a lab. Real life was a different story.
2020-05-09	Big Transfer (BiT): General Visual Representation Learning (Paper Explained)
2020-05-08	Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning (Paper Explained)
2020-05-07	WHO ARE YOU? 10k Subscribers Special (w/ Channel Analytics)
2020-05-06	Reinforcement Learning with Augmented Data (Paper Explained)
2020-05-05	TAPAS: Weakly Supervised Table Parsing via Pre-training (Paper Explained)
2020-05-04	Chip Placement with Deep Reinforcement Learning (Paper Explained)
2020-05-03	I talk to the new Facebook Blender Chatbot
2020-05-02	Jukebox: A Generative Model for Music (Paper Explained)
2020-05-01	[ML Coding Tips] Separate Computation & Plotting using locals
2020-04-30	The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies (Paper Explained)
2020-04-29	Deconstructing Lottery Tickets: Zeros, Signs, and the Supermask (Paper Explained)
2020-04-28	[Rant] Online Conferences
2020-04-27	Do ImageNet Classifiers Generalize to ImageNet? (Paper Explained)
2020-04-26	[Drama] Schmidhuber: Critique of Honda Prize for Dr. Hinton

Tags:

deep learning

machine learning

arxiv

explained

neural networks

artificial intelligence

paper

reinforcement learning

sac

ppo

deep rl

deep reinforcement learning

dreamer

curl

pixel

pretraining

deepmind

openai

berkeley

Channel	Latest
Skyprince777	13 hours ago
Tsubasa Yozora Ch.	13 hours ago
USIX Pro Gaming	14 hours ago
Arcade City	19 hours ago
alanzoka	20 hours ago
AnimeToons	20 hours ago
Flik's Gaming Stuff	21 hours ago
The Mexican Runner	22 hours ago
Beyond the Brick	22 hours ago
Spuffi	23 hours ago
442oons	1 day ago
Nintendo Life	1 day ago
Tamae	1 day ago
IntroGameOver	1 day ago
Dowell	1 day ago
Badaw Gaming	1 day ago
lugeyps3	1 day ago
CarbotAnimations	1 day ago
Pixelorez	1 day ago
Primal Koopa Pictures	1 day ago
BeastBoyShub	1 day ago
816	1 day ago
AoDzTo - อ๊อดโตะ	1 day ago
Chroma	1 day ago
Unnie Cj	1 day ago