Reachability Under Uncertainty & Bayesian Inverse Reinforcement Learning

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=1eDZlU0Ulzs



Duration: 1:09:07
700 views
4


This talk will present two advances made recently in my group. First, I will introduce a new network reachability problem where the goal is to find the most reliable path between two nodes in a network, represented as a directed acyclic graph. Individual edges within this network may fail according to certain probabilities, and these failure probabilities may depend on the values of one or more hidden variables. I will explain why this problem is harder than similar problems encountered in standard probabilistic inference. I will also an efficient approximation algorithm for this problem, and discuss open issues. The second advance is a generalization of Inverse Reinforcement Learning (IRL). IRL is the problem of learning the reward function underlying a Markov Decision Process given the dynamics of the system and the behaviour of an expert. It is motivated by situations where knowledge of the rewards is a goal by itself (as in preference elicitation) and by the task of apprenticeship learning (learning policies from an expert). In this part of the talk I will show how to combine prior knowledge and evidence from the expert's actions to derive a probability distribution over the space of reward functions. I will present efficient algorithms that find solutions for the reward learning and apprenticeship learning tasks that generalize well over these distributions. Experimental results show strong improvement for this methods over previous heuristic-based approaches. * Joint work with Allen Chang and Deepak Ramachandran (UAI'07; IJCAI'07)




Other Videos By Microsoft Research


2016-09-06In-Car Speech User Interfaces and their Effects on Driving Performance
2016-09-06Microcosm: E. coli and the New Science of Life
2016-09-06Dependable and Sustainable Cyber-Physical Computing - An Overview of IMPACT Lab's Research
2016-09-06General Theorem Proving for Satisfiability Modulo Theories: An Overview
2016-09-06Real-Time Concurrent Garbage Collection
2016-09-06Virtual Earth Summit - Session 2
2016-09-06Concept Lexicon Construction and Affective Analysis: From Photos to MTV
2016-09-06Cloud Computing for e-Science
2016-09-06Thread-saft dynamic binary translation using transactional memory
2016-09-06Scheduling for multi-carrier wireless systems
2016-09-06Reachability Under Uncertainty & Bayesian Inverse Reinforcement Learning
2016-09-06Exploring large social networks with matrix-based representations
2016-09-06Defining and Enforcing Privacy in Data Publishing
2016-09-06Zero Overhead Verification of Software Programs & On Range Search in Distributed Sensor Networks
2016-09-06XNA Game Studio Workshop - Session One
2016-09-06Reconfigurable Computing: Architectural and Design Tool Challenges
2016-09-06Knowledge sharing and awareness in collaborative computing: Experimental research methods
2016-09-06Multimodal Processing of Human Behavior in Intelligent Instrumented Spaces
2016-09-06Energy Based Models: From Relational Regression to Similarity Metric Learning
2016-09-06Multi-layer architectures for secure communication: information theoretic perspectives
2016-09-06Virtual Earth Summit - Session 4



Tags:
microsoft research