Exploring Reinforcement Learning Methods from Algorithm to Application

Channel:

Subscribers:

351,000

Published on May 27, 2021 3:17:39 AM ● Video Link: https://www.youtube.com/watch?v=LsztMquHGDg

Duration: 1:30:48

2,952 views

Reinforcement learning (RL) is a systematic approach to learning and decision making under uncertainty. Developed and studied for decades, recent combinations of RL with modern deep learning have led to impressive demonstrations of the capabilities of today's RL systems, and these new combinations have fueled an explosion of interest and research activity.

In this webinar led by Microsoft researcher Dr. Katja Hofmann, a Principal Researcher in the Game Intelligence group at Microsoft Research Cambridge, learn about the foundations of RL—elegant ideas giving rise to agents that can learn extremely complex behaviors in a wide range of settings. In the broader perspective, gain an overview of where we currently stand in terms of what is possible in RL from the researcher's perspective. The webinar concludes with an outlook on key opportunities—both for future research and real-world applications of RL.

Together, you'll explore:

■ Why a Markov Decision Process is a simple yet powerful abstraction for reinforcement learning problems
■ How to model a task as a reinforcement learning problem
■ The challenge of balancing exploration and exploitation in reinforcement learning
■ One of the fundamental approaches to reinforcement learning problems, Q-Learning, and how it solves the credit assignment problem
■ Q-learning with function approximation

𝗥𝗲𝘀𝗼𝘂𝗿𝗰𝗲 𝗹𝗶𝘀𝘁:

■ Game Intelligence (research group) - https://www.microsoft.com/en-us/research/group/deep-reinforcement-learning
■ Reinforcement Learning (research group) - https://www.microsoft.com/en-us/research/theme/reinforcement-learning-group
■ Project Malmo (project page) - https://www.microsoft.com/en-us/research/project/project-malmo
■ Optimistic Actor Critic avoids the pitfalls of greedy exploration in reinforcement learning(blog) - https://www.microsoft.com/en-us/research/blog/optimistic-actor-critic-avoids-the-pitfalls-of-greedy-exploration-in-reinforcement-learning
■ Malmo, Minecraft and machine learning with Dr. Katja Hofmann (Podcast) - https://www.microsoft.com/en-us/research/podcast/malmo-minecraft-and-machine-learning-with-dr-katja-hofmann
■ Project Malmo competition returns with student organizers and a new mission: To democratize reinforcement learning (blog) - https://www.microsoft.com/en-us/research/blog/project-malmo-competition-returns-with-student-organizers-and-a-new-mission-to-democratize-reinforcement-learning
■ Reinforcement Learning: Past, Present, and Future Perspectives (publication) - https://www.microsoft.com/en-us/research/publication/reinforcement-learning-past-present-and-future-perspectives

■ Learn about advanced topics in Reinforcement Learning: https://aka.ms/neurips-2019-rl-tutorial
■ Get started with the Malmo platform: https://github.com/Microsoft/malmo
■ Results of the MineRL competition 2019 @NeurIPS: https://minerl.io/competition/

■ Katja Hofmann (researcher profile) - https://www.microsoft.com/en-us/research/people/kahofman

*This on-demand webinar features a previously recorded Q&A session and open captioning.

This webinar originally aired on January 15, 2020

Explore more Microsoft Research webinars: https://aka.ms/msrwebinars

Other Videos By Microsoft Research

2021-06-09	Digital Characters in Virtual Experiences \| JRC Workshop 2021
2021-06-09	Reconstructing 3D Human with Learning-based Method \| JRC Workshop 2021
2021-06-09	Freetures: Localization in Signed Distance Function Maps \| JRC Workshop 2021
2021-06-03	Racist Tropes & Labor Discipline: How Tech Inherits & Reproduces Global Imaginaries of Race and Work
2021-06-02	Directions in ML: Latent Stochastic Differential Equations: An Unexplored Model Class
2021-05-27	Fuzzing to improve the security and reliability of cloud services with RESTler
2021-05-27	Pushing the frontier of neural text to speech
2021-05-27	Foundations of Real-World Reinforcement Learning
2021-05-27	Homomorphic Encryption with Microsoft SEAL
2021-05-27	Data Visualization: Bridging the Gap Between Users and Information
2021-05-26	Exploring Reinforcement Learning Methods from Algorithm to Application
2021-05-26	Microsoft Rocket: Hybrid Edge + Cloud Video Analytics Platform
2021-05-26	Harnessing high-fidelity simulation for autonomous systems through AirSim
2021-05-26	Microsoft ElectionGuard—enabling voters to verify that their votes are correctly counted
2021-05-26	Designing Computer Vision Algorithms to Describe the Visual World to People Who Are Blind/Low Vision
2021-05-26	The next generation of developer tools for data programming
2021-05-26	Expanding the possibilities of programming languages with Bosque
2021-05-26	Harnessing the problem-solving power of quantum computing
2021-05-25	Introducing Developer Velocity Lab to improve developers’ work and well-being
2021-05-24	Machine Learning and Fairness
2021-05-24	Post-quantum cryptography: Supersingular isogenies for beginners

Channel	Latest
JLO CESAR	6 hours ago
Prem Jeff SP	6 hours ago
domisumReplay: Renekton	6 hours ago
Mehmet Uzun	6 hours ago
domisumReplay: Syndra	6 hours ago
domisumReplay: Mordekaiser	7 hours ago
Shhoto	7 hours ago
DismArchus	7 hours ago
20fadhil: Revolution	7 hours ago
Zanginary	7 hours ago
Baba Behwish	7 hours ago
Camed P	7 hours ago
LegitKorea	7 hours ago
domisumReplay: Aatrox	7 hours ago
Lyna	7 hours ago
CamXPetra	7 hours ago
youRINK 🎶	7 hours ago
domisumReplay: Akali	7 hours ago
domisumReplay: Sett	7 hours ago
domisumReplay: Kayle	7 hours ago
iTownGamePlay Terror&Diversión	7 hours ago
Notably Nerdy	7 hours ago
David Voices	8 hours ago
Nickich	8 hours ago
Regiz	8 hours ago