CURL: Contrastive Unsupervised Representations for Reinforcement Learning

Channel:

Yannic Kilcher

Subscribers:

301,000

Published on April 11, 2020 11:04:57 AM ● Video Link: https://www.youtube.com/watch?v=hg2Q_O5b9w4

Duration: 28:45

11,007 views

361

Contrastive Learning has been an established method in NLP and Image classification. The authors show that with relatively minor adjustments, CL can be used to augment and improve RL dramatically.

Paper: https://arxiv.org/abs/2004.04136
Code: https://github.com/MishaLaskin/curl

Abstract:
We present CURL: Contrastive Unsupervised Representations for Reinforcement Learning. CURL extracts high-level features from raw pixels using contrastive learning and performs off-policy control on top of the extracted features. CURL outperforms prior pixel-based methods, both model-based and model-free, on complex tasks in the DeepMind Control Suite and Atari Games showing 2.8x and 1.6x performance gains respectively at the 100K interaction steps benchmark. On the DeepMind Control Suite, CURL is the first image-based algorithm to nearly match the sample-efficiency and performance of methods that use state-based features.

Authors: Aravind Srinivas, Michael Laskin, Pieter Abbeel

Links:
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
BitChute: https://www.bitchute.com/channel/yannic-kilcher
Minds: https://www.minds.com/ykilcher

Other Videos By Yannic Kilcher

2020-04-21	Gradient Surgery for Multi-Task Learning
2020-04-20	Longformer: The Long-Document Transformer
2020-04-20	Backpropagation and the brain
2020-04-18	Shortcut Learning in Deep Neural Networks
2020-04-17	Feature Visualization & The OpenAI microscope
2020-04-16	Datasets for Data-Driven Reinforcement Learning
2020-04-15	FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence
2020-04-14	Imputer: Sequence Modelling via Imputation and Dynamic Programming
2020-04-13	The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
2020-04-12	Dynamical Distance Learning for Semi-Supervised and Unsupervised Skill Discovery
2020-04-11	CURL: Contrastive Unsupervised Representations for Reinforcement Learning
2020-04-10	Enhanced POET: Open-Ended RL through Unbounded Invention of Learning Challenges and their Solutions
2020-04-09	Evolving Normalization-Activation Layers
2020-04-08	[Drama] Who invented Contrast Sets?
2020-04-07	Evaluating NLP Models via Contrast Sets
2020-04-06	POET: Endlessly Generating Increasingly Complex and Diverse Learning Environments and Solutions
2020-04-03	Dream to Control: Learning Behaviors by Latent Imagination
2020-04-02	Can we Contain Covid-19 without Locking-down the Economy?
2020-04-01	State-of-Art-Reviewing: A Radical Proposal to Improve Scientific Publication
2020-03-31	Agent57: Outperforming the Atari Human Benchmark
2020-03-30	Axial Attention & MetNet: A Neural Weather Model for Precipitation Forecasting

Tags:

deep learning

machine learning

reinforcement learning

unsupervised

contrast

contrastive

encoder

self-supervised

deep rl

representation

representation learning

query

key

Channel	Latest
SteveAB4EL	6 hours ago
Murano MLBB	6 hours ago
Blainokoshi	6 hours ago
KP Nation	6 hours ago
LeggySantos 👑	6 hours ago
ErwinTheGamer	6 hours ago
NovaExplosion	6 hours ago
foggedftw2	6 hours ago
ktshadow14	6 hours ago
BOSSzombie	6 hours ago
León Picarón	6 hours ago
Jondaliner	6 hours ago
Ege Güven	6 hours ago
Living Sun	6 hours ago
JohnMatrixTV	6 hours ago
Brani	6 hours ago
Whatopia	6 hours ago
GhostGaming	6 hours ago
AeSiD0.9	6 hours ago
DexTepa	6 hours ago
JasonR	6 hours ago
SinX6	7 hours ago
ThaiGameGuide	7 hours ago
SEÑOR ADRENALINA	7 hours ago
After Rift	7 hours ago