Vision to Language

Subscribers:
344,000
Published on ● Video Link: https://www.youtube.com/watch?v=0proEod5e3o



Duration: 1:19:42
42 views
0


The recent advances in computer vision, natural language processing and other related areas has led to a renewed interest in artificial intelligence applications spanning multiple domains. Specifically, the generation of natural human-like captions for images has seen an extraordinary increase in interest. In this session, the speakers provide insight into this area. They describe several techniques that combine state-of-the-art computer vision techniques and language models to produce descriptions of visual content with surprisingly high quality. The limitations of current approaches and the challenges that lie ahead are both emphasized.







Tags:
microsoft research
artificial intelligence
deep neural networks