Research talk: Successor feature sets: Generalizing successor representations across policies

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=Gfv26bD3Mlo



Duration: 9:33
419 views
0


Speaker: Kianté Brantley, PhD Student, University of Maryland

Successor-style representations have many advantages for reinforcement learning. For example, they can help an agent generalize from experience to new goals. However, successor-style representations are not optimized to generalize across policies—typically, a limited-length list of policies is maintained and information shared among them by representation learning or generalized policy iteration. Join University of Maryland PhD candidate Kianté Brantley to address these limitations in successor-style representations. With collaborators from Microsoft Research Montréal, he developed a new general successor-style representation, which brings together ideas from predictive state representations, belief space value iteration, and convex analysis. The new representation is highly expressive. For example, it allows for efficiently reading off an optimal policy for a new reward function or a policy that imitates a demonstration. Together, you’ll explore the basics of successor-style representation, the challenges of current approaches, and results of the proposed approach on small, known environments.

Learn more about the 2021 Microsoft Research Summit: https://Aka.ms/researchsummit




Other Videos By Microsoft Research


2022-02-08Plenary: Industrial Research in the 21st Century
2022-02-02Dead-end Discovery: How offline reinforcement learning could assist healthcare decision-makers
2022-01-27Microsoft Soundscape - overview of Routes feature
2022-01-24Research talk: Breaking the deadly triad with a target network
2022-01-24Research talk: Learning and pretraining strategies for dense retrieval in search and beyond
2022-01-24Opening remarks: Research for Industry
2022-01-24Opening remarks: Future of Cloud Networking
2022-01-24Research talk: Cloud networking for a post-Moore’s Law era
2022-01-24Industry talk: Key forces driving industry transformation and disruption
2022-01-24Research talk: Local factor models for large-scale inductive recommendation
2022-01-24Research talk: Successor feature sets: Generalizing successor representations across policies
2022-01-24Research talk: SPTAG++: Fast hundreds of billions-scale vector search with millisecond response time
2022-01-24Demo: Using network machine learning for organizational analytics
2022-01-24Opening remarks: New Future of Work
2022-01-24Research talk: Approximate nearest neighbor search systems at scale
2022-01-24Closing remarks: The Future of Search and Recommendation
2022-01-24Research talk: Semantic search science: How Microsoft Bing AI is powering Azure Cognitive Search
2022-01-24Research talk: Extracting pragmatics from content interactions to improve enterprise recommendations
2022-01-24Practical tips for productivity & wellbeing: Microproductivity strategy to do work in short bursts
2022-01-24Research talk: Summarizing information across multiple documents and modalities
2022-01-24Research talk: System frontiers for dense retrieval



Tags:
reward-based learning
reinforcement learning
innovation in artificial environments
accelerate AI
microsoft research summit