When Can We Use Weak Function Approximation to Solve Large Scale Planning Problems in MDPs?

Channel:

Simons Institute for the Theory of Computing

Subscribers:

68,700

Published on September 17, 2022 5:59:52 AM ● Video Link: https://www.youtube.com/watch?v=Y8m21HciOWI

Duration: 51:45

776 views

Csaba Szepesvári (University of Alberta, Google DeepMind)
https://simons.berkeley.edu/talks/tbd-483
Quantifying Uncertainty: Stochastic, Adversarial, and Beyond

At the dawn of the computer age in the 1960s, Bellman and his co-workers already found it useful to use linear function approximation to solve some multistage (or sequential) planning problems. Their approach was simple: Just use function approximation to avoid state-space discretization and thus keep all computation poly-time, while also controlling for accuracy. However, the question of when and how is this possible has eluded researchers for at least 50 years. The partial results obtained suggested that the approximation spaces used need to have an intricate relationship to the problem to be solved and it may not be sufficient that the target function (say, the optimal value function) lies in this space. In this talk I will give an overview of recent work in this area, which essentially closed most of the open questions in the simplest finite horizon setting. While the picture that emerges is interesting, most of the results are negative. The conclusion is that the approximation spaces indeed be better special, or generality needs to be sacrificed in some other way.

Other Videos By Simons Institute for the Theory of Computing

2022-09-27	Common Graphs with Arbitrary Chromatic Number
2022-09-27	Higher-Order Graphon Theory: Fluctuations, Inference, and Degeneracies
2022-09-27	A Large Deviation Principle for Block Models
2022-09-27	Mean-field approximations for high-dimensional Bayesian Regression
2022-09-27	Response of Graphs to Competing Constraints
2022-09-27	Sparse Random Graphs: Interplay of Local and Global Structure
2022-09-26	Analytic Approach to Guasirandomness
2022-09-26	Random Cluster Model on Regular Graphs
2022-09-17	Generalization and Robustness in Offline Reinforcement Learning
2022-09-17	Adaptivity and Confounding in Multi-armed Bandit Experiments
2022-09-16	When Can We Use Weak Function Approximation to Solve Large Scale Planning Problems in MDPs?
2022-09-16	Dynamic Regret Minimization for Bandits without Prior Knowledge
2022-09-16	Oracle-Efficient Online Learning or: How to Use Non-Robust Optimization for Robust Learning
2022-09-16	Adaptive Monopoly Regulation
2022-09-16	Information Collection Through Strategic Agents
2022-09-16	Incentivized Exploration
2022-09-15	Attributes: Selective Learning and Influence
2022-09-15	Markov Persuasion Process and its Reinforcement Learning
2022-09-15	Parsimonious Learning-Augmented Algorithms
2022-09-14	Machine Learning for Faster Optimization
2022-09-14	Flow Time Scheduling with Uncertain Processing Time

Tags:

Simons Institute

theoretical computer science

UC Berkeley

Computer Science

Theory of Computation

Theory of Computing

Quantifying Uncertainty: Stochastic Adversarial and Beyond

Csaba Szepesvári

Channel	Latest
Sey Senpai	8 hours ago
Vardoc1	10 hours ago
Anton Petrov	10 hours ago
Many A True Nerd	11 hours ago
LInk02	11 hours ago
Mon Facts	12 hours ago
GeorgeMallouris	13 hours ago
Big punchman	13 hours ago
Jakou	13 hours ago
HOWTONEVOLUTION	13 hours ago
Brunoborne	13 hours ago
Goodblue77	13 hours ago
lugeyps3	13 hours ago
Stan's Mod Gaming	13 hours ago
OPEN TV	14 hours ago
neXzen MMD & MUSIC	14 hours ago
flipswitch3111	14 hours ago
WalkthroughGuy	14 hours ago
ТРЕНДИ ШОРТС	14 hours ago
eagLe34	14 hours ago
Melody /ميلودي	14 hours ago
Linkwolf	14 hours ago
아루우	14 hours ago
Nostradamus	14 hours ago
Xeres Artrophel Ch.	14 hours ago