Foundations of Real-World Reinforcement Learning

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=A6wNJ4-MpIg



Duration: 1:23:22
3,563 views
85


Reinforcement learning (RL) is an approach to sequential decision making under uncertainty which formalizes the principles for designing an autonomous learning agent. The broad goal of a reinforcement learning agent is to find an optimal policy which maximizes its long-term rewards over time. Its list of applications is growing as the technology advances and continues to be further integrated into many areas, such as education, health, advertising, autonomous systems, and gaming.

By starting from the perspective of an agent which interacts with and affects its environment, RL provides an improvement upon supervised learning in situations requiring decisions, and not just predictions. In particular, it motivates exploratory actions to discover novel rewarding behavior in the environment, a hallmark of intelligent agents.

In this webinar—led by Microsoft Researchers John Langford, Partner Research Manager with over a decade of experience in reinforcement learning-related research, and Alekh Agarwal, Principal Research Manager and leader of the Reinforcement Learning group in Redmond—learn how RL works to impact real-world problems across a variety of domains.

Together, you'll explore:

■ The definition and uses of RL, from a general paradigm to its broad range of applications
■ The various benefits of using RL as well as its current challenges
■ The specific types of RL—contextual bandits, imitation learning, and strategic exploration
■ Where these cutting-edge methods might take the future of RL.

𝗥𝗲𝘀𝗼𝘂𝗿𝗰𝗲 𝗹𝗶𝘀𝘁:

■ Reinforcement learning for the real world with Dr. John Langford and Rafah Hosn (Podcast) - https://www.microsoft.com/en-us/research/podcast/reinforcement-learning-for-the-real-world-with-dr-john-langford-and-rafah-hosn/
■ Real World Reinforcement Learning (Project page) - https://www.microsoft.com/en-us/research/project/real-world-reinforcement-learning/
■ Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning (Publication) - https://www.microsoft.com/en-us/research/publication/kinematic-state-abstraction-and-provably-efficient-rich-observation-reinforcement-learning/
■ Provably efficient reinforcement learning with rich observations (Blog) - https://www.microsoft.com/en-us/research/blog/provably-efficient-reinforcement-learning-with-rich-observations/
■ ICML 2017 Tutorial on Real World Interactive Learning (Tutorial) - https://hunch.net/~rwil/
■ Machine Learning (Theory) (John Langford’s blog) - https://hunch.net/
■ Vowpal Wabbit (open source project) - https://vowpalwabbit.org/
■ Reinforcement Learning (Career opportunities) - https://www.microsoft.com/en-us/research/theme/reinforcement-learning-group/#!opportunities
■ Alekh Agarwal (researcher profile) - https://www.microsoft.com/en-us/research/people/alekha/
■ John Langford (researcher profile) - https://www.microsoft.com/en-us/research/people/jcl/

*This on-demand webinar features a previously recorded Q&A session and open captioning.

This webinar originally aired on December 5, 2019

Explore more Microsoft Research webinars: https://aka.ms/msrwebinars




Other Videos By Microsoft Research


2021-06-09Controllable Human Motion Generation from Trajectories | JRC Workshop 2021
2021-06-09Towards Markerless Surgical Tool and Hand Pose Estimation | JRC Workshop 2021
2021-06-09Project Altair: Infrared Vision and AI-Decision Making for Longer Drone Flights
2021-06-09Digital Characters in Virtual Experiences | JRC Workshop 2021
2021-06-09Reconstructing 3D Human with Learning-based Method | JRC Workshop 2021
2021-06-09Freetures: Localization in Signed Distance Function Maps | JRC Workshop 2021
2021-06-03Racist Tropes & Labor Discipline: How Tech Inherits & Reproduces Global Imaginaries of Race and Work
2021-06-02Directions in ML: Latent Stochastic Differential Equations: An Unexplored Model Class
2021-05-27Fuzzing to improve the security and reliability of cloud services with RESTler
2021-05-27Pushing the frontier of neural text to speech
2021-05-27Foundations of Real-World Reinforcement Learning
2021-05-27Homomorphic Encryption with Microsoft SEAL
2021-05-27Data Visualization: Bridging the Gap Between Users and Information
2021-05-26Exploring Reinforcement Learning Methods from Algorithm to Application
2021-05-26Microsoft Rocket: Hybrid Edge + Cloud Video Analytics Platform
2021-05-26Harnessing high-fidelity simulation for autonomous systems through AirSim
2021-05-26Microsoft ElectionGuard—enabling voters to verify that their votes are correctly counted
2021-05-26Designing Computer Vision Algorithms to Describe the Visual World to People Who Are Blind/Low Vision
2021-05-26The next generation of developer tools for data programming
2021-05-26Expanding the possibilities of programming languages with Bosque
2021-05-26Harnessing the problem-solving power of quantum computing



Tags:
Reinforcement learning
autonomous learning agent
autonomous agent
John Langford
Alekh Agarwal
Vowpal Wabbit
Machine Learning
Microsoft Research