CURL: Contrastive Unsupervised Representations for Reinforcement Learning

Subscribers:
284,000
Published on ● Video Link: https://www.youtube.com/watch?v=hg2Q_O5b9w4



Duration: 28:45
11,007 views
361


Contrastive Learning has been an established method in NLP and Image classification. The authors show that with relatively minor adjustments, CL can be used to augment and improve RL dramatically.

Paper: https://arxiv.org/abs/2004.04136
Code: https://github.com/MishaLaskin/curl

Abstract:
We present CURL: Contrastive Unsupervised Representations for Reinforcement Learning. CURL extracts high-level features from raw pixels using contrastive learning and performs off-policy control on top of the extracted features. CURL outperforms prior pixel-based methods, both model-based and model-free, on complex tasks in the DeepMind Control Suite and Atari Games showing 2.8x and 1.6x performance gains respectively at the 100K interaction steps benchmark. On the DeepMind Control Suite, CURL is the first image-based algorithm to nearly match the sample-efficiency and performance of methods that use state-based features.

Authors: Aravind Srinivas, Michael Laskin, Pieter Abbeel

Links:
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
BitChute: https://www.bitchute.com/channel/yannic-kilcher
Minds: https://www.minds.com/ykilcher







Tags:
deep learning
machine learning
rl
reinforcement learning
unsupervised
contrast
contrastive
encoder
self-supervised
deep rl
representation
representation learning
query
key