Towards ad hoc interactions with robots

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=F7Sqa8qypm8



Duration: 1:02:33
17 views
0


A primary motivation for work within my group is the notion of autonomous agents that can interact, robustly over the long term, with an incompletely known environment that continually changes. In this talk I will describe results from a few different projects that attempt to address key aspects of this big question. I will begin by looking at how task encodings can be made effective using qualitative (geometric) structure in the strategy space. Using examples that may be familiar to many machine learning researchers ΓÇô such as control of an inverted pendulum and bipedal walkers ΓÇô we will explore this connection between the geometric structure of solutions and strategies for dealing with a continually changing task context. The key result here would be regarding ways to combine exploitation of 'natural' dynamics with the benefits of active planning. Can there be similarly flexible encodings for more general decision problems, beyond the domain of robot control? I will describe recent results from our work on policy reuse and transfer learning, demonstrating how it is possible to construct agents that can learn to adapt, through a process of belief updating based on policy performance, to a changing task context including the case where the change may be induced by other decision making agents. Finally, building on this theme of making decisions in the presence of other decision making agents, I will briefly describe results from our recent experiments in human-robot interaction where agents must learn to influence the behaviour of other agents in order to achieve their task. This experiment is a step towards general and implementable models of ad hoc interaction where agents learn from experience to shape aspects of that interaction without the benefits of prior coordination and related knowledge. I will conclude with some remarks on the potential practical uses of such models and learning methods in a wide variety of applications ranging from personal robotics to intelligent user interfaces.




Other Videos By Microsoft Research


2016-08-11Stochastic Optimization and Sparse Statistical Recovery: An Optimal Algorithm for High Dimensions
2016-08-11Jim Grey Award & The Possibilities and Pitfalls Internet-Based Chemical Data
2016-08-11WP8 - Using native code in ISV apps
2016-08-11Music Intelligence & the 'Taste Profile' - What Computers Think of You and Your Music Taste
2016-08-11Machine Learning Course - Lecture 8
2016-08-11Panel: Scientific Data: the Current Landscape, Challenges, and Solutions
2016-08-11Machine Learning Work Shop - A Minimax Entropy Method for Learning the Wisdom of Crowds
2016-08-11Publishing and eScience Panel
2016-08-11Design-led Innovation
2016-08-11Future Perfect: The Case for Progress in A Networked Age
2016-08-11Towards ad hoc interactions with robots
2016-08-11Dynamically Enforcing Knowledge-based Security Policies
2016-08-11Real Applications of Non-Real Numbers
2016-08-11From the Information Extraction Pipeline to Global Models, and Back
2016-08-11Some Algorithmic Problems in High Dimensions
2016-08-11Machine Learning Course - Lecture 2
2016-08-11Panel: Open Data for Open Science - Data Interoperability
2016-08-11Cloud Computing - What Do Researchers Want? - A Panel Discussion
2016-08-11Machine Learning Work Shop - Recovery of Simultaneously Structured Models by Convex Optimization
2016-08-11Machine Learning Work Shop- A Proximal-Gradient Homotopy Method for the Sparse Least-Squares Problem
2016-08-11Machine Learning Work Shop - Combining Machine and Human Intelligence in Crowdsourcing



Tags:
microsoft research