Policy Gradient: Optimal Estimation, Convergence, and Generalization beyond Cumulative Rewards

Channel:

Simons Institute for the Theory of Computing

Subscribers:

68,700

Published on February 25, 2022 12:12:15 PM ● Video Link: https://www.youtube.com/watch?v=gRI4ZyHLGzc

Duration: 0:00

615 views

Mengdi Wang (Princeton University)
https://simons.berkeley.edu/talks/tbd-365
Adversarial Approaches in Machine Learning

Other Videos By Simons Institute for the Theory of Computing

2022-03-11	The Impact of National Service on Beliefs, Mindsets, and Life Pathways...
2022-03-10	Monitoring People and Their Vital Signs Using Radio Signals and Machine Learning
2022-03-08	Causality and Autoencoders in Light of Drug Repurposing for COVID-19
2022-03-05	Thinking Like a Journalist — Science Communicator in Residence Talk
2022-02-26	Adversarial Machine Learning and Instrumental Variables for Flexible Causal Modeling
2022-02-26	Online Adversarial Multicalibration And (Multi)Calibeating
2022-02-26	Rebel: Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
2022-02-26	Generalized Energy-Based Models
2022-02-25	Learning Uninformative Representations
2022-02-25	Of Moments and Matching: Trade-offs and Treatments in Imitation Learning
2022-02-25	Policy Gradient: Optimal Estimation, Convergence, and Generalization beyond Cumulative Rewards
2022-02-25	Are Single-Loop Algorithms Sufficient for Unbalanced Minimax Optimization?
2022-02-25	Zeroth-Order Methods for Convex-Concave Minmax Problems: Learning from Strategically Generated Data
2022-02-25	Halpern Iteration and Equilibria Problems
2022-02-24	Computationally Efficient Alternatives to Nonconvex-Nonconcave Min-Max Optimization
2022-02-16	Balancing Covariates In Randomized Experiments: The Gram--Schmidt Walk Design
2022-02-16	Some Staged Tree Models For Learning From Interventions
2022-02-16	Learning And Testing Causal Models: A Property Testing Viewpoint
2022-02-16	Panel Discussion
2022-02-16	Machine Learning-Based Design Of Proteins
2022-02-15	Searching For Causal Genetic Mechanisms Across Human Populations

Channel	Latest
OPUS ASTORA	6 hours ago
Bring the Asteroid	6 hours ago
Reyju Gaming	6 hours ago
Shravan Srinivasan	7 hours ago
アベレージ / Average Channel	7 hours ago
SUPER SOCCER RUBRO NEGRO -ᄅ-	7 hours ago
The Other Guy	7 hours ago
OMNIxEVIL	7 hours ago
Seer CRZ	7 hours ago
Dreezus	7 hours ago
CANAL JOSÉ MOURA FALANDO FUTEBOL E OUTROS ESPORTES	7 hours ago
DarkXP	8 hours ago
Hanz Meltya Ch.	8 hours ago
Savage Slayer	8 hours ago
Live Stream	8 hours ago
中野あるま / Alma Nakano	8 hours ago
SELECTZ FF	8 hours ago
WolfePack Gaming Den	8 hours ago
jigga876	8 hours ago
Sion Truesilver	8 hours ago
Raging Lumberjack	8 hours ago
Dota2 PERU ez	8 hours ago
BizarreObscure	8 hours ago
savagejesusj	8 hours ago
BlackPearL	8 hours ago