Foundations of Real-World Reinforcement Learning

Channel:

Subscribers:

351,000

Published on May 27, 2021 8:22:15 PM ● Video Link: https://www.youtube.com/watch?v=A6wNJ4-MpIg

Duration: 1:23:22

3,563 views

Reinforcement learning (RL) is an approach to sequential decision making under uncertainty which formalizes the principles for designing an autonomous learning agent. The broad goal of a reinforcement learning agent is to find an optimal policy which maximizes its long-term rewards over time. Its list of applications is growing as the technology advances and continues to be further integrated into many areas, such as education, health, advertising, autonomous systems, and gaming.

By starting from the perspective of an agent which interacts with and affects its environment, RL provides an improvement upon supervised learning in situations requiring decisions, and not just predictions. In particular, it motivates exploratory actions to discover novel rewarding behavior in the environment, a hallmark of intelligent agents.

In this webinar—led by Microsoft Researchers John Langford, Partner Research Manager with over a decade of experience in reinforcement learning-related research, and Alekh Agarwal, Principal Research Manager and leader of the Reinforcement Learning group in Redmond—learn how RL works to impact real-world problems across a variety of domains.

Together, you'll explore:

■ The definition and uses of RL, from a general paradigm to its broad range of applications
■ The various benefits of using RL as well as its current challenges
■ The specific types of RL—contextual bandits, imitation learning, and strategic exploration
■ Where these cutting-edge methods might take the future of RL.

𝗥𝗲𝘀𝗼𝘂𝗿𝗰𝗲 𝗹𝗶𝘀𝘁:

■ Reinforcement learning for the real world with Dr. John Langford and Rafah Hosn (Podcast) - https://www.microsoft.com/en-us/research/podcast/reinforcement-learning-for-the-real-world-with-dr-john-langford-and-rafah-hosn/
■ Real World Reinforcement Learning (Project page) - https://www.microsoft.com/en-us/research/project/real-world-reinforcement-learning/
■ Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning (Publication) - https://www.microsoft.com/en-us/research/publication/kinematic-state-abstraction-and-provably-efficient-rich-observation-reinforcement-learning/
■ Provably efficient reinforcement learning with rich observations (Blog) - https://www.microsoft.com/en-us/research/blog/provably-efficient-reinforcement-learning-with-rich-observations/
■ ICML 2017 Tutorial on Real World Interactive Learning (Tutorial) - https://hunch.net/~rwil/
■ Machine Learning (Theory) (John Langford’s blog) - https://hunch.net/
■ Vowpal Wabbit (open source project) - https://vowpalwabbit.org/
■ Reinforcement Learning (Career opportunities) - https://www.microsoft.com/en-us/research/theme/reinforcement-learning-group/#!opportunities
■ Alekh Agarwal (researcher profile) - https://www.microsoft.com/en-us/research/people/alekha/
■ John Langford (researcher profile) - https://www.microsoft.com/en-us/research/people/jcl/

*This on-demand webinar features a previously recorded Q&A session and open captioning.

This webinar originally aired on December 5, 2019

Explore more Microsoft Research webinars: https://aka.ms/msrwebinars

Other Videos By Microsoft Research

2021-06-09	Controllable Human Motion Generation from Trajectories \| JRC Workshop 2021
2021-06-09	Towards Markerless Surgical Tool and Hand Pose Estimation \| JRC Workshop 2021
2021-06-09	Project Altair: Infrared Vision and AI-Decision Making for Longer Drone Flights
2021-06-09	Digital Characters in Virtual Experiences \| JRC Workshop 2021
2021-06-09	Reconstructing 3D Human with Learning-based Method \| JRC Workshop 2021
2021-06-09	Freetures: Localization in Signed Distance Function Maps \| JRC Workshop 2021
2021-06-03	Racist Tropes & Labor Discipline: How Tech Inherits & Reproduces Global Imaginaries of Race and Work
2021-06-02	Directions in ML: Latent Stochastic Differential Equations: An Unexplored Model Class
2021-05-27	Fuzzing to improve the security and reliability of cloud services with RESTler
2021-05-27	Pushing the frontier of neural text to speech
2021-05-27	Foundations of Real-World Reinforcement Learning
2021-05-27	Homomorphic Encryption with Microsoft SEAL
2021-05-27	Data Visualization: Bridging the Gap Between Users and Information
2021-05-26	Exploring Reinforcement Learning Methods from Algorithm to Application
2021-05-26	Microsoft Rocket: Hybrid Edge + Cloud Video Analytics Platform
2021-05-26	Harnessing high-fidelity simulation for autonomous systems through AirSim
2021-05-26	Microsoft ElectionGuard—enabling voters to verify that their votes are correctly counted
2021-05-26	Designing Computer Vision Algorithms to Describe the Visual World to People Who Are Blind/Low Vision
2021-05-26	The next generation of developer tools for data programming
2021-05-26	Expanding the possibilities of programming languages with Bosque
2021-05-26	Harnessing the problem-solving power of quantum computing

Tags:

Reinforcement learning

autonomous learning agent

autonomous agent

John Langford

Alekh Agarwal

Vowpal Wabbit

Machine Learning

Microsoft Research

Channel	Latest
domisumReplay: Renekton	6 hours ago
Mehmet Uzun	6 hours ago
domisumReplay: Syndra	6 hours ago
domisumReplay: Mordekaiser	6 hours ago
Shhoto	7 hours ago
DismArchus	7 hours ago
Zanginary	7 hours ago
Baba Behwish	7 hours ago
LegitKorea	7 hours ago
domisumReplay: Aatrox	7 hours ago
CamXPetra	7 hours ago
youRINK 🎶	7 hours ago
domisumReplay: Akali	7 hours ago
domisumReplay: Sett	7 hours ago
domisumReplay: Kayle	7 hours ago
iTownGamePlay Terror&Diversión	7 hours ago
David Voices	7 hours ago
Nickich	8 hours ago
Regiz	8 hours ago
PUBG MOBILE Esports MEA	8 hours ago
League of SUPPORT - LOL Replays	8 hours ago
Happy Animes Recaps	8 hours ago
HeroxHeroTV	8 hours ago
SiIvaGunner	8 hours ago
Oh Shiitake Mushrooms	8 hours ago