Uncertainty-Aware Action Advising for Deep Reinforcement Learning Agents | AISC

Channel:

LLMs Explained - Aggregate Intellect - AI.SCIENCE

Subscribers:

22,600

Published on July 17, 2020 5:03:04 AM ● Video Link: https://www.youtube.com/watch?v=LMATIxX3ccI

Duration: 59:25

397 views

For slides and more information on the paper, visit https://ai.science/e/uncertainty-aware-action-advising-for-deep-reinforcement-learning-agents--gvft1hbacsycJX0VCfht

Speaker: Matthew Taylor, Felipe Leno da Silva; Discussion Facilitator: Susan Shu Chang

Motivation:
Although Reinforcement Learning (RL) has been one of the most successful approaches for learning in sequential decision making problems, the sample-complexity of RL techniques still represents a major challenge for practical applications. To combat this challenge, whenever a competent policy (e.g., either a legacy system or a human demonstrator) is available, the agent could leverage samples from this policy (advice) to improve sample-efficiency.

In this work, we propose Requesting Confidence-Moderated Policy advice (RCMP), an action-advising framework where the agent asks for advice when its epistemic uncertainty is high for a certain state. RCMP takes into account that the advice is limited and might be suboptimal. Our empirical evaluations show that RCMP performs better than Importance Advising, not receiving advice, and receiving it at random states in Gridworld and Atari Pong scenarios.

Other Videos By LLMs Explained - Aggregate Intellect - AI.SCIENCE

2020-07-30	Bounded Rationality in Las Vegas: Probabilistic Finite Automata PlayMulti-Armed Bandits \| AISC
2020-07-30	Information Retrieval for Price Consistency Monitoring - Liu Yang (Amazon)
2020-07-29	Quantum Technologies: State of Play \| AISC
2020-07-29	NLP on Noisy User-generated text - NER for StackOverflow \| AISC
2020-07-28	Overview of Machine Learning in Marketing \| AISC
2020-07-24	Cognitive Model Priors for Predicting Human Decisions \| AISC
2020-07-22	Machine Learning to Assess Trends and Alignment of Funded Research Output \| AISC
2020-07-22	COVID and Racial Inequity, and Implications for AI
2020-07-21	TGN: Temporal Graph Networks for Deep Learning on Dynamic Graphs [Paper Explained by the Author]
2020-07-20	Founders in Fundraising, and AI Applications
2020-07-16	Uncertainty-Aware Action Advising for Deep Reinforcement Learning Agents \| AISC
2020-07-16	Machine Learning for Forecasting Global Atmospheric Models \| AISC
2020-07-15	Towards Frequency-Based Explanation for Robust CNN \| AISC
2020-07-14	Lagrangian Neural Networks \| AISC
2020-07-14	TeslaSenti: Near real-time sentiment analysis of Tesla tweets \| Workshop Capstone
2020-07-14	See.Know.Bias - Using AI to Develop Media Literacy and Keep News Neutral \| workshop capstone
2020-07-10	Video Action Transformer Network \| AISC
2020-07-09	DLIME - Let's dig into the code (model explainability stream)
2020-07-09	Overview of Machine Learning in Behavioral Economics \| AISC
2020-07-08	Compact Neural Representation Using Attentive Network Pruning \| AISC
2020-07-08	Navigating the Idea Maze: Continuous discovery frameworks for (AI?) products \| AISC

Channel	Latest
TDK	6 hours ago
Planeta de Los Tutoriales	6 hours ago
ayose peres el ideologo	6 hours ago
Shinyhuntress Alexis	6 hours ago
Prakash S Gaming	6 hours ago
eSDe Toys	7 hours ago
MARKET MINDSET	7 hours ago
POACH	7 hours ago
Rafael Crispim Bezerra	7 hours ago
Marv Gonzales	7 hours ago
BLINDING HOPE LONGPLAY	7 hours ago
Reza Gaming	7 hours ago
Le Monde de Dragthor	7 hours ago
Overwatch DAILY	7 hours ago
Janasena4u	7 hours ago
Saber Kingscrown	7 hours ago
WhizKey	7 hours ago
Tech Technical sk	7 hours ago
PS5 PLANET	7 hours ago
KIRAN GAMING	7 hours ago
Mr. Souer	7 hours ago
Obake PAM Ch.	7 hours ago
Shourize Hobby	7 hours ago
SeriouslyTheCat	7 hours ago
DARK Gaming	8 hours ago