Monte Carlo Methods for Bayesian Reinforcement Learning and POMDP

Channel:

Subscribers:

348,000

Published on August 12, 2016 2:13:01 AM ● Video Link: https://www.youtube.com/watch?v=nxB8aBBMqGk

Duration: 44:23

3,138 views

Partially Observable Markov Decision Process is an elegant and general model for planning under uncertainty. Applications for POMDPs include control of autonomous vehicles, dialog systems, and systems for providing assistance to the elderly. Learning problems such as reinforcement learning, making recommendations and active learning can also be posed as POMDPs. Unfortunately, solving POMDPs is computationally intractable. When the state space is not too large, we give conditions under which solving POMDPs becomes computationally easier, and describe algorithms for solving such problems. We extend the algorithms to very large or infinite state spaces using Monte Carlo methods.

Other Videos By Microsoft Research

2016-08-11	Concurrent Data Representation Synthesis
2016-08-11	A Domain Specific Language for Testing Concurrent Programs
2016-08-11	Planning Under Uncertainty: Challenges and Recent Progress
2016-08-11	The Changing Landscape of Parallel Computing - Applications
2016-08-11	Summer Number Theory Day; Session 3
2016-08-11	Annotating Images with Words, Phrases and Sentences
2016-08-11	A Historical View of Large Margin Optimization Methods
2016-08-11	Component Based Models: Graphical Models, Sparsity, Low-rank, and all of that Sort of Thing
2016-08-11	Adaptive Sampling for Ranking and Clustering
2016-08-11	Program Verification via SVMs
2016-08-11	Monte Carlo Methods for Bayesian Reinforcement Learning and POMDP
2016-08-11	Faculty Summit 2012 - Design Expo
2016-08-11	Statistical Consistency and Regret Bounds for Ranking
2016-08-11	Achieving High Data Rates in a Distributed MIMO System
2016-08-11	Building Knowledge Bases from the Web
2016-08-11	Random Graph Models of Kidney Exchange
2016-08-11	Useful Spatio-Temporal Abstractions in Reinforcement Learning?
2016-08-11	Markov Logic for Statistical Relational Learning
2016-08-11	Supervised Dimension Reduction
2016-08-11	Scalable Inference of Attributes in Entity-Relationship Graphs
2016-08-11	Making SVMs Robust to Uncertainty in Kernel Matrices

Tags:

microsoft research

Channel	Latest
Nintendo Life	8 hours ago
lugeyps3	9 hours ago
Pixelorez	11 hours ago
Chroma	12 hours ago
Unnie Cj	12 hours ago
Brecy	12 hours ago
Renzuwu	12 hours ago
Fal Oval	12 hours ago
fadd game	12 hours ago
Aezwozere	12 hours ago
눈사람	12 hours ago
Fragilistic	12 hours ago
akitokid 青色夜想曲	13 hours ago
soydianagames	13 hours ago
상상상상	13 hours ago
Lucivius	13 hours ago
Ruckquez Nd Stuff	13 hours ago
野武士ノディー	13 hours ago
fan komar	13 hours ago
Tiago Vanz	13 hours ago
Reap	13 hours ago
ありなみパイセン	13 hours ago
69SportTV	13 hours ago
CHINGLAI HUNTER	13 hours ago
잡기사	13 hours ago