Exploring Reinforcement Learning Methods from Algorithm to Application

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=LsztMquHGDg



Duration: 1:30:48
2,952 views
79


Reinforcement learning (RL) is a systematic approach to learning and decision making under uncertainty. Developed and studied for decades, recent combinations of RL with modern deep learning have led to impressive demonstrations of the capabilities of today's RL systems, and these new combinations have fueled an explosion of interest and research activity.

In this webinar led by Microsoft researcher Dr. Katja Hofmann, a Principal Researcher in the Game Intelligence group at Microsoft Research Cambridge, learn about the foundations of RL—elegant ideas giving rise to agents that can learn extremely complex behaviors in a wide range of settings. In the broader perspective, gain an overview of where we currently stand in terms of what is possible in RL from the researcher's perspective. The webinar concludes with an outlook on key opportunities—both for future research and real-world applications of RL.

Together, you'll explore:

■ Why a Markov Decision Process is a simple yet powerful abstraction for reinforcement learning problems
■ How to model a task as a reinforcement learning problem
■ The challenge of balancing exploration and exploitation in reinforcement learning
■ One of the fundamental approaches to reinforcement learning problems, Q-Learning, and how it solves the credit assignment problem
■ Q-learning with function approximation

𝗥𝗲𝘀𝗼𝘂𝗿𝗰𝗲 𝗹𝗶𝘀𝘁:

■ Game Intelligence (research group) - https://www.microsoft.com/en-us/research/group/deep-reinforcement-learning
■ Reinforcement Learning (research group) - https://www.microsoft.com/en-us/research/theme/reinforcement-learning-group
■ Project Malmo (project page) - https://www.microsoft.com/en-us/research/project/project-malmo
■ Optimistic Actor Critic avoids the pitfalls of greedy exploration in reinforcement learning(blog) - https://www.microsoft.com/en-us/research/blog/optimistic-actor-critic-avoids-the-pitfalls-of-greedy-exploration-in-reinforcement-learning
■ Malmo, Minecraft and machine learning with Dr. Katja Hofmann (Podcast) - https://www.microsoft.com/en-us/research/podcast/malmo-minecraft-and-machine-learning-with-dr-katja-hofmann
■ Project Malmo competition returns with student organizers and a new mission: To democratize reinforcement learning (blog) - https://www.microsoft.com/en-us/research/blog/project-malmo-competition-returns-with-student-organizers-and-a-new-mission-to-democratize-reinforcement-learning
■ Reinforcement Learning: Past, Present, and Future Perspectives (publication) - https://www.microsoft.com/en-us/research/publication/reinforcement-learning-past-present-and-future-perspectives

■ Learn about advanced topics in Reinforcement Learning: https://aka.ms/neurips-2019-rl-tutorial
■ Get started with the Malmo platform: https://github.com/Microsoft/malmo
■ Results of the MineRL competition 2019 @NeurIPS: https://minerl.io/competition/

■ Katja Hofmann (researcher profile) - https://www.microsoft.com/en-us/research/people/kahofman

*This on-demand webinar features a previously recorded Q&A session and open captioning.

This webinar originally aired on January 15, 2020

Explore more Microsoft Research webinars: https://aka.ms/msrwebinars




Other Videos By Microsoft Research


2021-06-09Digital Characters in Virtual Experiences | JRC Workshop 2021
2021-06-09Reconstructing 3D Human with Learning-based Method | JRC Workshop 2021
2021-06-09Freetures: Localization in Signed Distance Function Maps | JRC Workshop 2021
2021-06-03Racist Tropes & Labor Discipline: How Tech Inherits & Reproduces Global Imaginaries of Race and Work
2021-06-02Directions in ML: Latent Stochastic Differential Equations: An Unexplored Model Class
2021-05-27Fuzzing to improve the security and reliability of cloud services with RESTler
2021-05-27Pushing the frontier of neural text to speech
2021-05-27Foundations of Real-World Reinforcement Learning
2021-05-27Homomorphic Encryption with Microsoft SEAL
2021-05-27Data Visualization: Bridging the Gap Between Users and Information
2021-05-26Exploring Reinforcement Learning Methods from Algorithm to Application
2021-05-26Microsoft Rocket: Hybrid Edge + Cloud Video Analytics Platform
2021-05-26Harnessing high-fidelity simulation for autonomous systems through AirSim
2021-05-26Microsoft ElectionGuard—enabling voters to verify that their votes are correctly counted
2021-05-26Designing Computer Vision Algorithms to Describe the Visual World to People Who Are Blind/Low Vision
2021-05-26The next generation of developer tools for data programming
2021-05-26Expanding the possibilities of programming languages with Bosque
2021-05-26Harnessing the problem-solving power of quantum computing
2021-05-25Introducing Developer Velocity Lab to improve developers’ work and well-being
2021-05-24Machine Learning and Fairness
2021-05-24Post-quantum cryptography: Supersingular isogenies for beginners