Learning World Graphs to Accelerate Hierarchical Reinforcement Learning

Channel:

Yannic Kilcher

Subscribers:

291,000

Published on August 8, 2019 8:56:43 AM ● Video Link: https://www.youtube.com/watch?v=Qk4lJdp7ZAs

Duration: 18:39

3,111 views

The goal of hierarchical reinforcement learning is to divide a task into different levels of coarseness with the top-level agent planning only over a high-level view of the world and each subsequent layer having a more detailed view. This paper proposes to learn a set of important states as well as their connections to each other as a high-level abstraction.

https://arxiv.org/abs/1907.00664

Abstract:
In many real-world scenarios, an autonomous agent often encounters various tasks within a single complex environment. We propose to build a graph abstraction over the environment structure to accelerate the learning of these tasks. Here, nodes are important points of interest (pivotal states) and edges represent feasible traversals between them. Our approach has two stages. First, we jointly train a latent pivotal state model and a curiosity-driven goal-conditioned policy in a task-agnostic manner. Second, provided with the information from the world graph, a high-level Manager quickly finds solution to new tasks and expresses subgoals in reference to pivotal states to a low-level Worker. The Worker can then also leverage the graph to easily traverse to the pivotal states of interest, even across long distance, and explore non-locally. We perform a thorough ablation study to evaluate our approach on a suite of challenging maze tasks, demonstrating significant advantages from the proposed framework over baselines that lack world graph knowledge in terms of performance and efficiency.

Authors: Wenling Shang, Alex Trott, Stephan Zheng, Caiming Xiong, Richard Socher

Other Videos By Yannic Kilcher

2019-10-15	LeDeepChef 👨‍🍳 Deep Reinforcement Learning Agent for Families of Text-Based Games
2019-10-14	[News] The Siraj Raval Controversy
2019-10-07	Accelerating Deep Learning by Focusing on the Biggest Losers
2019-09-05	DEEP LEARNING MEME REVIEW - Episode 1
2019-09-04	Dynamic Routing Between Capsules
2019-09-03	RoBERTa: A Robustly Optimized BERT Pretraining Approach
2019-08-28	Auditing Radicalization Pathways on YouTube
2019-08-13	Gauge Equivariant Convolutional Networks and the Icosahedral CNN
2019-08-12	Processing Megapixel Images with Deep Attention-Sampling Models
2019-08-09	Manifold Mixup: Better Representations by Interpolating Hidden States
2019-08-08	Learning World Graphs to Accelerate Hierarchical Reinforcement Learning
2019-08-05	Reconciling modern machine learning and the bias-variance trade-off
2019-07-05	Conversation about Population-Based Methods (Re-upload)
2019-07-03	XLNet: Generalized Autoregressive Pretraining for Language Understanding
2019-06-13	Talking to companies at ICML19
2019-06-12	Population-Based Search and Open-Ended Algorithms
2019-06-10	I'm at ICML19 :)
2019-05-14	Adversarial Examples Are Not Bugs, They Are Features
2019-05-10	Reinforcement Learning, Fast and Slow
2019-05-09	S.H.E. - Search. Human. Equalizer.
2019-05-06	Blockwise Parallel Decoding for Deep Autoregressive Models

Tags:

deep learning

reinforcement learning

deep reinforcement learning

world model

hierarchical reinforcement learning

planning

salesforce

research

machine learning

navigation

pivot states

artificial intelligence

Channel	Latest
F34RTEHR34PER	6 hours ago
Senhor Leoncio	6 hours ago
Sdanwolf	6 hours ago
BossKing lol	6 hours ago
Gaming Movie Database - IGMDb.org	7 hours ago
Chigel	7 hours ago
fantayzia	7 hours ago
Dee True Crime	7 hours ago
Canal MangaQ	7 hours ago
chocoTaco	7 hours ago
비행돼지	7 hours ago
NOTA MEX	7 hours ago
AyChristene	7 hours ago
Edu Primitivo	7 hours ago
smbhax 2000	7 hours ago
Guillaume Brien	7 hours ago
Pixelatino	7 hours ago
ScrubNoob	7 hours ago
Canal do Saullo	7 hours ago
TDM_Heyzeus	7 hours ago
Wappen	7 hours ago
The One & The Only, Triple Da G.O.D!	7 hours ago
Kartoffel König	7 hours ago
Antimatéria	7 hours ago
vLADOPARD 404	7 hours ago