MSR Distinguished Lecture Series: First-person Perception and Interaction

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=e3FThz9jjOI



Duration: 1:09:24
701 views
29


Computer vision has seen major success in learning to recognize objects from massive “disembodied” Web photo collections labeled by human annotators. Yet cognitive science tells us that perception develops in the context of acting the world---and without intensive supervision. Meanwhile, many realistic vision tasks require not only categorizing a well-composed human-taken photo, but also actively deciding where to look in the first place. In the context of these challenges, we are exploring how machine perception benefits from anticipating the sights and sounds an agent will experience as a function of its own actions. Based on this premise, we introduce methods for learning to look around intelligently in novel environments, learning from video how to interact with objects, and perceiving audio-visual streams for both semantic and spatial context. Together, these are steps towards first-person perception, where interaction with the world is itself a supervisory signal.

See more at https://www.microsoft.com/en-us/research/video/first-person-perception-and-interaction/




Other Videos By Microsoft Research


2020-06-10Transparency and Intelligibility Throughout the Machine Learning Life Cycle
2020-06-10Machine Learning and Fairness Webinar
2020-06-10Consumer Brain-Computer Interfaces: From Science Fiction to Reality
2020-06-10Highly Conductive Flexible Sensor Integrated With Personal Devices For Practical Bio-Signal Measure
2020-06-08Microsoft Build 2020: Kevin Scott keynote with Lila Tretikov
2020-06-03Harvesting randomness, HAIbrid algorithms and safe AI with Dr. Siddhartha Sen | Podcast
2020-06-03Provably efficient reinforcement learning with Dr. Akshay Krishnamurthy | Podcast
2020-06-01What ‘bhasha’ do you want to talk in? With Kalika Bali and Dr. Monojit Choudhury | Podcast
2020-05-26Explaining Decisions from Vision Models and Correcting them via Human Feedback
2020-05-26Auditing Outsourced Services
2020-05-26MSR Distinguished Lecture Series: First-person Perception and Interaction
2020-05-26Large-scale live video analytics over 5G multi-hop camera networks
2020-05-26Kristin Lauter's TED Talk on Private AI at Congreso Futuro during Panel 11 / SOLVE
2020-05-19How an AI agent can balance a pole using a simulation
2020-05-19How to build Intelligent control systems using new tools from Microsoft and simulations by Mathworks
2020-05-13Diving into Deep InfoMax with Dr. Devon Hjelm | Podcast
2020-05-08An Introduction to Graph Neural Networks: Models and Applications
2020-05-07MSR Cambridge Lecture Series: Photonic-chip-based soliton microcombs
2020-05-07Multi-level Optimization Approaches to Computer Vision
2020-05-05How good is your classifier? Revisiting the role of evaluation metrics in machine learning
2020-05-05Fast and Flexible Multi-Task Classification Using Conditional Neural Adaptive Processes



Tags:
Computer vision
cognitive science
machine perception
semantic and spatial context
Eric Horvitz
Kristen Grauman
AI
Microsoft Research AI lab