Vision-and-Dialog Navigation

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=XL3FMpceYoE



Category:
Vlog
Duration: 56:42
665 views
17


Dialog-enabled smart assistants, which communicate via natural language and occupy human homes, have seen widespread adoption in recent years. These systems can communicate information, but do not manipulate objects or move themselves. By contrast, manipulation-capable and mobile robots are still largely deployed in industrial settings, but do not interact with human users. Dialog-enabled robots can bridge this gap, with natural language interfaces helping robots and non-experts collaborate to achieve their goals. In particular, navigation in unseen or dynamic environments to high-level goals (e.g., "Go to the room with a plant") can be facilitated by enabling navigation agents to ask questions in language, and to react to human clarifications on-the-fly. To study this challenge, we introduce Cooperative Vision-and-Dialog Navigation, an English language dataset situated in the Matterport Room-2-Room simulation environment.

See more at https://www.microsoft.com/en-us/research/video/vision-and-dialog-navigation/




Other Videos By Microsoft Research


2019-10-14Improving Doctor-Patient Interaction with ML-Enabled Clinical Note Taking
2019-10-11HapSense: A Soft Haptic I/O Device with Uninterrupted Dual Functionalities...
2019-10-09Advanced polarized light microscopy for mapping molecular orientation
2019-10-09Data science and ML for human well-being with Jina Suh [Podcast]
2019-10-07Tea: A High-level Language and Runtime System for Automating Statistical Analysis [Python module]
2019-10-07Discover[i]: Component-based Parameterized Reasoning for Distributed Applications
2019-10-04Scheduling For Efficient Large-Scale Machine Learning Training
2019-10-03Distributed Entity Resolution for Computational Social Science
2019-10-03MMLSpark: empowering AI for Good with Mark Hamilton [Podcast]
2019-10-02Non-linear Invariants for Control-Command Systems
2019-10-02Vision-and-Dialog Navigation
2019-10-01The Future of Mathematics?
2019-09-30How Not to Prove Your Election Outcome
2019-09-30The Worst Form Including All Those Others: Canada’s Experiments with Online Voting
2019-09-30DIFF: A Relational Interface for Large-Scale Data Explanation
2019-09-30A Calculus for Brain Computation
2019-09-26Decoding Multisensory Attention from Electroencephalography for Use in a Brain-Computer Interface
2019-09-26A Short Introduction to DIMACS & DIMACS and MSR-NYC
2019-09-26Boosting Innovation and Discovery of Ideas
2019-09-26Resource-Efficient Redundancy for Large-Scale Data Processing and Storage Systems
2019-09-26Optimizing Declarative Graph Queries at Large Scale



Tags:
smart assistants
natural language
Dialog-enabled robots
navigation agents
Cooperative Vision-and-Dialog Navigation
Matterport Room-2-Room
AI
human-computer interaction
microsoft research
robotics