Visual Understanding in Natural Language

Channel:

Subscribers:

351,000

Published on January 25, 2018 6:29:48 PM ● Video Link: https://www.youtube.com/watch?v=LAWeOZdvRvE

Duration: 1:20:30

1,329 views

Bridging visual and natural language understanding is a fundamental requirement for intelligent agents. This talk will focus mainly on automatic image captioning and visual question answering (VQA). I will cover some recent advances in automatic image caption evaluation, visual attention modeling and generalization to images 'in the wild'. I will also introduce my recent work on vision-and-language navigation (VLN), in which we situate agents in a new RL environment constructed from dense RGB-D imagery of 90 real buildings.

See more at https://www.microsoft.com/en-us/research/video/visual-understanding-in-natural-language/

Other Videos By Microsoft Research

2018-02-06	Project Belgrade, User Stories - Jo
2018-02-06	Project Belgrade, User Stories - Katrine
2018-02-06	Project Belgrade, User Stories - Steven
2018-02-04	Toward Scalable Fontlings - July 24, 2006
2018-02-04	Kid Tab Demo - Part 1
2018-02-03	Kid Tab Demo - Part 2
2018-02-02	Hand Representations for Typing in VR VR 2018
2018-02-02	Text Entry in Immersive HMDisplay based VR using Physical and Touch Keyboards
2018-01-31	Data Wrangling using Programming by Examples
2018-01-26	Experimentally Reducing Partisan Incivility on Twitter
2018-01-25	Visual Understanding in Natural Language
2018-01-25	Understanding Over-parametrization Through Matrix Sensing
2018-01-24	Sustainable Computing at Scale for Smart Cities
2018-01-24	Transport Modelling and user Behavior in Urban Systems
2018-01-24	Frugal Innovations for Rural Areas - Panel Discussion
2018-01-22	A peek into Digital Green
2018-01-15	Fiat Cryptography: Automatic Correct-by-Construction Generation of Low-Level Cryptographic Code
2018-01-15	Building Scale VR: Creating Indoor 3D Maps & its Application to Simulation of Disaster Situations
2018-01-12	Digital Society: Panel Discussion
2018-01-11	Lokavidya
2018-01-10	CRISPR.ML - Machine learning meets gene editing

Tags:

microsoft research

visual understanding

natural language

intelligent agents

automatic image captioning

visual question answering

VQA

VLN

Channel	Latest
BoraLo	6 hours ago
GAMErHyNas	6 hours ago
ChessBase India	6 hours ago
EvGeN Channel	6 hours ago
MG Surprise Toys	6 hours ago
Gaming Raju	6 hours ago
egboj20	6 hours ago
Adjie Cahyono	7 hours ago
Zenix4U	7 hours ago
Gothic Sorcerer	7 hours ago
ᗷᖇᑌᑕE ᒪEE ᖴIST Oᖴ ᖴᑌᖇY	7 hours ago
ATMの裏側	7 hours ago
JastrzabPost	7 hours ago
Dragon Fights	7 hours ago
DIVIDED GAMERS	7 hours ago
MGTracey	7 hours ago
ShaggyJonJ	7 hours ago
Alif Rahza	7 hours ago
Simulation	7 hours ago
THANATOS	7 hours ago
EVO World of Tanks Replays	7 hours ago
MLBB-مواجهة الأبطال	7 hours ago
JK _00	7 hours ago
チャンネルふいしんく【huisync】	7 hours ago
DieHahn	7 hours ago