Uncertainty-Aware Action Advising for Deep Reinforcement Learning Agents | AISC

Published on ● Video Link: https://www.youtube.com/watch?v=LMATIxX3ccI



Duration: 59:25
397 views
19


For slides and more information on the paper, visit https://ai.science/e/uncertainty-aware-action-advising-for-deep-reinforcement-learning-agents--gvft1hbacsycJX0VCfht

Speaker: Matthew Taylor, Felipe Leno da Silva; Discussion Facilitator: Susan Shu Chang

Motivation:
Although Reinforcement Learning (RL) has been one of the most successful approaches for learning in sequential decision making problems, the sample-complexity of RL techniques still represents a major challenge for practical applications. To combat this challenge, whenever a competent policy (e.g., either a legacy system or a human demonstrator) is available, the agent could leverage samples from this policy (advice) to improve sample-efficiency.

In this work, we propose Requesting Confidence-Moderated Policy advice (RCMP), an action-advising framework where the agent asks for advice when its epistemic uncertainty is high for a certain state. RCMP takes into account that the advice is limited and might be suboptimal. Our empirical evaluations show that RCMP performs better than Importance Advising, not receiving advice, and receiving it at random states in Gridworld and Atari Pong scenarios.




Other Videos By LLMs Explained - Aggregate Intellect - AI.SCIENCE


2020-07-30Bounded Rationality in Las Vegas: Probabilistic Finite Automata PlayMulti-Armed Bandits | AISC
2020-07-30Information Retrieval for Price Consistency Monitoring - Liu Yang (Amazon)
2020-07-29Quantum Technologies: State of Play | AISC
2020-07-29NLP on Noisy User-generated text - NER for StackOverflow | AISC
2020-07-28Overview of Machine Learning in Marketing | AISC
2020-07-24Cognitive Model Priors for Predicting Human Decisions | AISC
2020-07-22Machine Learning to Assess Trends and Alignment of Funded Research Output | AISC
2020-07-22COVID and Racial Inequity, and Implications for AI
2020-07-21TGN: Temporal Graph Networks for Deep Learning on Dynamic Graphs [Paper Explained by the Author]
2020-07-20Founders in Fundraising, and AI Applications
2020-07-16Uncertainty-Aware Action Advising for Deep Reinforcement Learning Agents | AISC
2020-07-16Machine Learning for Forecasting Global Atmospheric Models | AISC
2020-07-15Towards Frequency-Based Explanation for Robust CNN | AISC
2020-07-14Lagrangian Neural Networks | AISC
2020-07-14TeslaSenti: Near real-time sentiment analysis of Tesla tweets | Workshop Capstone
2020-07-14See.Know.Bias - Using AI to Develop Media Literacy and Keep News Neutral | workshop capstone
2020-07-10Video Action Transformer Network | AISC
2020-07-09DLIME - Let's dig into the code (model explainability stream)
2020-07-09Overview of Machine Learning in Behavioral Economics | AISC
2020-07-08Compact Neural Representation Using Attentive Network Pruning | AISC
2020-07-08Navigating the Idea Maze: Continuous discovery frameworks for (AI?) products | AISC