Neural Image Caption Generation with Visual Attention (discussion) | AISC

Published on ● Video Link: https://www.youtube.com/watch?v=u_Mdp_3RVRA



Category:
Discussion
Duration: 18:38
996 views
10


Toronto Deep Learning Series, 12 November 2018

Paper: http://proceedings.mlr.press/v37/xuc15.pdf

Speaker: Waseem Gharbieh (Twenty Billion Neurons GmbH)

Host: Rangle.io
Date: Nov 12th, 2018

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

Inspired by recent work in machine translation and object detection, we introduce an attention based model that automatically learns to describe the content of images. We describe how we can train this model in a deterministic manner using standard backpropagation techniques and stochastically by maximizing a variational lower bound. We also show through visualization how the model is able to automatically learn to fix its gaze on salient objects while generating the corresponding words in the output sequence. We validate the use of attention with state-of-the art performance on three benchmark datasets: Flickr9k, Flickr30k and MS COCO




Other Videos By LLMs Explained - Aggregate Intellect - AI.SCIENCE


2018-12-09Automated Vulnerability Detection in Source Code Using Deep Learning (algorithm) | AISC
2018-12-05[DQN] Human-level control through deep reinforcement learning (discussions) | AISC Foundational
2018-12-05Deep Q-Learning paper explained: Human-level control through deep reinforcement learning (algorithm)
2018-12-03SMOTE, Synthetic Minority Over-sampling Technique (discussions) | AISC Foundational
2018-12-02TDLS - Classics: SMOTE, Synthetic Minority Over-sampling Technique (algorithm)
2018-11-30Visualizing Data using t-SNE (algorithm) | AISC Foundational
2018-11-30Visualizing Data using t-SNE (discussions) | AISC Foundational
2018-11-27[BERT] Pretranied Deep Bidirectional Transformers for Language Understanding (discussions) | TDLS
2018-11-27[BERT] Pretranied Deep Bidirectional Transformers for Language Understanding (algorithm) | TDLS
2018-11-27Neural Image Caption Generation with Visual Attention (algorithm) | AISC
2018-11-27Neural Image Caption Generation with Visual Attention (discussion) | AISC
2018-11-17PGGAN | Progressive Growing of GANs for Improved Quality, Stability, and Variation (part 2) | AISC
2018-11-16PGGAN | Progressive Growing of GANs for Improved Quality, Stability, and Variation (part 1) | AISC
2018-11-16(Original Paper) Latent Dirichlet Allocation (discussions) | AISC Foundational
2018-11-15(Original Paper) Latent Dirichlet Allocation (algorithm) | AISC Foundational
2018-10-31[Transformer] Attention Is All You Need | AISC Foundational
2018-10-25[Original attention] Neural Machine Translation by Jointly Learning to Align and Translate | AISC
2018-10-16[StackGAN++] Realistic Image Synthesis with Stacked Generative Adversarial Networks | AISC
2018-10-11Bayesian Deep Learning on a Quantum Computer | TDLS Author Speaking
2018-10-02Prediction of Cardiac arrest from physiological signals in the pediatric ICU | TDLS Author Speaking
2018-09-24Junction Tree Variational Autoencoder for Molecular Graph Generation | TDLS



Tags:
machine vision
deep learning
machine learning
ai
toronto
artificial intelligence
canada
visiual attention
image captioning
image captioning deep learning
attention model