Regularizing Trajectory Optimization with Denoising Autoencoders (Paper Explained)

Channel:

Yannic Kilcher

Subscribers:

291,000

Published on May 24, 2020 3:31:04 PM ● Video Link: https://www.youtube.com/watch?v=UjJU13GdL94

Duration: 29:58

5,151 views

188

Can you plan with a learned model of the world? Yes, but there's a catch: The better your planning algorithm is, the more the errors of your world model will hurt you! This paper solves this problem by regularizing the planning algorithm to stay in high probability regions, given its experience.

https://arxiv.org/abs/1903.11981

Interview w/ Harri: https://youtu.be/HnZDmxYnpg4

Abstract:
Trajectory optimization using a learned model of the environment is one of the core elements of model-based reinforcement learning. This procedure often suffers from exploiting inaccuracies of the learned model. We propose to regularize trajectory optimization by means of a denoising autoencoder that is trained on the same trajectories as the model of the environment. We show that the proposed regularization leads to improved planning with both gradient-based and gradient-free optimizers. We also demonstrate that using regularized trajectory optimization leads to rapid initial learning in a set of popular motor control tasks, which suggests that the proposed approach can be a useful tool for improving sample efficiency.

Authors: Rinu Boney, Norman Di Palo, Mathias Berglund, Alexander Ilin, Juho Kannala, Antti Rasmus, Harri Valpola

Links:
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
BitChute: https://www.bitchute.com/channel/yannic-kilcher
Minds: https://www.minds.com/ykilcher

Other Videos By Yannic Kilcher

2020-06-03	Learning To Classify Images Without Labels (Paper Explained)
2020-06-02	On the Measure of Intelligence by François Chollet - Part 1: Foundations (Paper Explained)
2020-06-01	Dynamics-Aware Unsupervised Discovery of Skills (Paper Explained)
2020-05-31	Synthesizer: Rethinking Self-Attention in Transformer Models (Paper Explained)
2020-05-30	[Code] How to use Facebook's DETR object detection algorithm in Python (Full Tutorial)
2020-05-29	GPT-3: Language Models are Few-Shot Learners (Paper Explained)
2020-05-28	DETR: End-to-End Object Detection with Transformers (Paper Explained)
2020-05-27	mixup: Beyond Empirical Risk Minimization (Paper Explained)
2020-05-26	A critical analysis of self-supervision, or what we can learn from a single image (Paper Explained)
2020-05-25	Deep image reconstruction from human brain activity (Paper Explained)
2020-05-24	Regularizing Trajectory Optimization with Denoising Autoencoders (Paper Explained)
2020-05-23	[News] The NeurIPS Broader Impact Statement
2020-05-22	When BERT Plays the Lottery, All Tickets Are Winning (Paper Explained)
2020-05-21	[News] OpenAI Model Generates Python Code
2020-05-20	Investigating Human Priors for Playing Video Games (Paper & Demo)
2020-05-19	iMAML: Meta-Learning with Implicit Gradients (Paper Explained)
2020-05-18	[Code] PyTorch sentiment classifier from scratch with Huggingface NLP Library (Full Tutorial)
2020-05-17	Planning to Explore via Self-Supervised World Models (Paper Explained)
2020-05-16	[News] Facebook's Real-Time TTS system runs on CPUs only!
2020-05-15	Weight Standardization (Paper Explained)
2020-05-14	[Trash] Automated Inference on Criminality using Face Images

Tags:

deep learning

machine learning

arxiv

explained

neural networks

artificial intelligence

paper

reinforcement learning

model predictive control

dae

denoising autoencoders

trajectory

trajectory optimization

planning

adversarial attack

errors

open loop

closed loop

joint

probability

derivative

gaussian

experience

learned model

world model

model predictive

mpc

Channel	Latest
alanzoka	10 hours ago
Beyond the Brick	12 hours ago
Nintendo Life	14 hours ago
IntroGameOver	15 hours ago
lugeyps3	16 hours ago
CarbotAnimations	17 hours ago
Pixelorez	17 hours ago
Primal Koopa Pictures	17 hours ago
BeastBoyShub	17 hours ago
Chroma	18 hours ago
Unnie Cj	18 hours ago
Brecy	19 hours ago
Renzuwu	19 hours ago
Fal Oval	19 hours ago
fadd game	19 hours ago
Aezwozere	19 hours ago
눈사람	19 hours ago
Fragilistic	19 hours ago
akitokid 青色夜想曲	19 hours ago
soydianagames	19 hours ago
상상상상	19 hours ago
Lucivius	19 hours ago
Ruckquez Nd Stuff	19 hours ago
野武士ノディー	19 hours ago
fan komar	19 hours ago