Reachability Under Uncertainty & Bayesian Inverse Reinforcement Learning

Channel:

Subscribers:

351,000

Published on September 6, 2016 5:21:01 PM ● Video Link: https://www.youtube.com/watch?v=1eDZlU0Ulzs

Duration: 1:09:07

700 views

This talk will present two advances made recently in my group. First, I will introduce a new network reachability problem where the goal is to find the most reliable path between two nodes in a network, represented as a directed acyclic graph. Individual edges within this network may fail according to certain probabilities, and these failure probabilities may depend on the values of one or more hidden variables. I will explain why this problem is harder than similar problems encountered in standard probabilistic inference. I will also an efficient approximation algorithm for this problem, and discuss open issues. The second advance is a generalization of Inverse Reinforcement Learning (IRL). IRL is the problem of learning the reward function underlying a Markov Decision Process given the dynamics of the system and the behaviour of an expert. It is motivated by situations where knowledge of the rewards is a goal by itself (as in preference elicitation) and by the task of apprenticeship learning (learning policies from an expert). In this part of the talk I will show how to combine prior knowledge and evidence from the expert's actions to derive a probability distribution over the space of reward functions. I will present efficient algorithms that find solutions for the reward learning and apprenticeship learning tasks that generalize well over these distributions. Experimental results show strong improvement for this methods over previous heuristic-based approaches. * Joint work with Allen Chang and Deepak Ramachandran (UAI'07; IJCAI'07)

Other Videos By Microsoft Research

2016-09-06	In-Car Speech User Interfaces and their Effects on Driving Performance
2016-09-06	Microcosm: E. coli and the New Science of Life
2016-09-06	Dependable and Sustainable Cyber-Physical Computing - An Overview of IMPACT Lab's Research
2016-09-06	General Theorem Proving for Satisfiability Modulo Theories: An Overview
2016-09-06	Real-Time Concurrent Garbage Collection
2016-09-06	Virtual Earth Summit - Session 2
2016-09-06	Concept Lexicon Construction and Affective Analysis: From Photos to MTV
2016-09-06	Cloud Computing for e-Science
2016-09-06	Thread-saft dynamic binary translation using transactional memory
2016-09-06	Scheduling for multi-carrier wireless systems
2016-09-06	Reachability Under Uncertainty & Bayesian Inverse Reinforcement Learning
2016-09-06	Exploring large social networks with matrix-based representations
2016-09-06	Defining and Enforcing Privacy in Data Publishing
2016-09-06	Zero Overhead Verification of Software Programs & On Range Search in Distributed Sensor Networks
2016-09-06	XNA Game Studio Workshop - Session One
2016-09-06	Reconfigurable Computing: Architectural and Design Tool Challenges
2016-09-06	Knowledge sharing and awareness in collaborative computing: Experimental research methods
2016-09-06	Multimodal Processing of Human Behavior in Intelligent Instrumented Spaces
2016-09-06	Energy Based Models: From Relational Regression to Similarity Metric Learning
2016-09-06	Multi-layer architectures for secure communication: information theoretic perspectives
2016-09-06	Virtual Earth Summit - Session 4

Tags:

microsoft research

Channel	Latest
Porygon do Caos	6 hours ago
【公認】ドズぼん・ザ・クリップ【ドズル社切り抜き】	6 hours ago
Munam Aslam	6 hours ago
VIRUS GAMING	6 hours ago
Superb Game	6 hours ago
BoraLo	6 hours ago
Julius PRESET 379 rb x ditonton • 5 jam yang lalu	6 hours ago
Studen Albatroz	6 hours ago
CallMeSam	6 hours ago
Silverhawk	6 hours ago
GAMErHyNas	6 hours ago
Cloudie McGaming	6 hours ago
BritFlicks \| Indie Film Trailers Worldwide	6 hours ago
ChessBase India	6 hours ago
EvGeN Channel	6 hours ago
MG Surprise Toys	6 hours ago
Gaming Raju	6 hours ago
MAVOmusic	6 hours ago
MangoxLoco	6 hours ago
Microboy	6 hours ago
egboj20	6 hours ago
TMossBoss	7 hours ago
Jwnwm Basumatary	7 hours ago
Adjie Cahyono	7 hours ago
MICRO.	7 hours ago