Research talk: Focal Attention: Towards local-global interactions in vision transformers

Channel:

Subscribers:

351,000

Published on February 8, 2022 4:30:26 PM ● Video Link: https://www.youtube.com/watch?v=dhAz_hgoG1M

Duration: 7:40

432 views

Speaker: Jianwei Yang, Senior Researcher, Microsoft Research Redmond

At present, deep neural networks have become prevalent for building AI systems for vision, language and multimodality. However, how to build efficient and task-oriented models are still challenging problems for researchers. In these lightning talks, Senior RSDE Baolin Peng and Senior Researcher Jianwei Yang from the Deep Learning Group at Microsoft Research Redmond, will discuss end-to-end dialog systems and new architecture for vision systems, respectively. For dialog systems, an end-to-end learning system is achieved by using self-learning from the conversations with a human in the loop. For vision systems, a sparse attention mechanism has been developed for the Vision Transformer to cope with high-resolution image inputs for image classification, object detection and semantic segmentation.

Learn more about the 2021 Microsoft Research Summit: https://Aka.ms/researchsummit

Other Videos By Microsoft Research

2022-02-08	Tutorial: Best practices for prioritizing fairness in AI systems
2022-02-08	Demo: RAI Toolbox: An open-source framework for building responsible AI
2022-02-08	Opening remarks: Responsible AI
2022-02-08	Closing remarks: Deep Learning and Large Scale AI
2022-02-08	Roundtable discussion: Beyond language models: Knowledge, multiple modalities, and more
2022-02-08	Research talk: Closing the loop in natural language interfaces to relational databases
2022-02-08	Just Tech: Bringing CS, the social sciences, and communities together for societal resilience
2022-02-08	Research talk: WebQA: Multihop and multimodal
2022-02-08	Opening remarks: Tech for resilient communities
2022-02-08	Research talk: Towards Self-Learning End-to-end Dialog Systems
2022-02-08	Research talk: Focal Attention: Towards local-global interactions in vision transformers
2022-02-08	Research talk: Knowledgeable pre-trained language models
2022-02-08	Opening remarks: Deep Learning and Large-Scale AI
2022-02-08	Closing remarks: Cloud Intelligence/AIOps
2022-02-08	Research talk: Optimizing the cloud supply chain
2022-02-08	Research talk: Automating and Optimizing IT Operations Management with AI
2022-02-08	Research talk: An intelligent data-driven paradigm towards cloud reliability
2022-02-08	Talk: Multidimensional analysis of cloud-native software based on large-scale operation data
2022-02-08	Keynote: Cloud Intelligence: Infusing AI into cloud computing systems
2022-02-08	Opening remarks: Cloud Intelligence/AIOps
2022-02-08	Research talk: NUWA: Neural visual world creation with multimodal pretraining

Tags:

deep learning

large-scale models

large-scale AI models

artificial intelligence

microsoft research summit

Channel	Latest
BoraLo	6 hours ago
GAMErHyNas	6 hours ago
ChessBase India	6 hours ago
EvGeN Channel	6 hours ago
MG Surprise Toys	6 hours ago
Gaming Raju	6 hours ago
egboj20	6 hours ago
Adjie Cahyono	7 hours ago
Zenix4U	7 hours ago
Gothic Sorcerer	7 hours ago
ᗷᖇᑌᑕE ᒪEE ᖴIST Oᖴ ᖴᑌᖇY	7 hours ago
ATMの裏側	7 hours ago
JastrzabPost	7 hours ago
Dragon Fights	7 hours ago
DIVIDED GAMERS	7 hours ago
MGTracey	7 hours ago
ShaggyJonJ	7 hours ago
Alif Rahza	7 hours ago
Simulation	7 hours ago
THANATOS	7 hours ago
EVO World of Tanks Replays	7 hours ago
MLBB-مواجهة الأبطال	7 hours ago
JK _00	7 hours ago
チャンネルふいしんく【huisync】	7 hours ago
DieHahn	7 hours ago