Recurrent Neural Network Writes Sentences About Images | Two Minute Papers #23

Subscribers:
1,720,000
Published on ● Video Link: https://www.youtube.com/watch?v=e-WB4lfg30M



Duration: 2:41
26,235 views
538


This technique is a combination of two powerful machine learning algorithms:
- convolutional neural networks are excellent at image classification, i.e., finding out what is seen on an input image,
- recurrent neural networks that are capable of processing a sequence of inputs and outputs, therefore it can create sentences of what is seen on the image.

Combining these two techniques makes it possible for a computer to describe in a sentence what is seen on an input image.

_____________________

The paper "Deep Visual-Semantic Alignments for Generating Image Descriptions" is available here:
http://cs.stanford.edu/people/karpathy/deepimagesent/

A gallery with more results with the same algorithm:
http://cs.stanford.edu/people/karpathy/deepimagesent/generationdemo/

You can train your own convolutional neural network here:
http://cs.stanford.edu/people/karpathy/convnetjs/demo/cifar10.html

The source code for the project is now available here:
https://github.com/karpathy/neuraltalk2

Subscribe if you would like to see more of these! - http://www.youtube.com/subscription_center?add_user=keeroyz

The thumbnail image background was made by Georgie Pauwels (CC BY 2.0) - https://flic.kr/p/qrRciQ
Splash screen/thumbnail design: Felícia Fehér - http://felicia.hu

Károly Zsolnai-Fehér's links:
Patreon → https://www.patreon.com/TwoMinutePapers
Facebook → https://www.facebook.com/TwoMinutePapers/
Twitter → https://twitter.com/karoly_zsolnai
Web → https://cg.tuwien.ac.at/~zsolnai/




Other Videos By Two Minute Papers


2015-12-12OpenAI - Non-profit AI company by Elon Musk and Sam Altman
2015-12-10Randomness and Bell's Inequality [Audio only] | Two Minute Papers #31
2015-12-03Automatic Parameter Control for Metropolis Light Transport | Two Minute Papers #30
2015-11-29Artificial Superintelligence [Audio only] | Two Minute Papers #29
2015-11-25Are We Living In a Computer Simulation? | Two Minute Papers #28
2015-11-22Google DeepMind's Deep Q-Learning & Superhuman Atari Gameplays | Two Minute Papers #27
2015-11-21Multiple-Scattering Microfacet BSDFs with the Smith Model
2015-11-18Terrain Traversal with Reinforcement Learning | Two Minute Papers #26
2015-11-15Cryptography, Perfect Secrecy and One Time Pads | Two Minute Papers #25
2015-11-11How Does Deep Learning Work? | Two Minute Papers #24
2015-11-07Recurrent Neural Network Writes Sentences About Images | Two Minute Papers #23
2015-11-06Be a Part of Two Minute Papers on Patreon!
2015-11-03Automatic Lecture Notes From Videos | Two Minute Papers #22
2015-10-30Real-Time Facial Expression Transfer | Two Minute Papers #21
2015-10-26Gradients, Poisson's Equation and Light Transport | Two Minute Papers #20
2015-10-23Recurrent Neural Network Writes Music and Shakespeare Novels | Two Minute Papers #19
2015-10-20Modeling Colliding and Merging Fluids | Two Minute Papers #18
2015-10-173D Printing a Glockenspiel | Two Minute Papers #17
2015-10-14Metropolis Light Transport | Two Minute Papers #16
2015-10-11Synthesizing Sound From Collisions | Two Minute Papers #15
2015-10-06Adaptive Cloth Simulations | Two Minute Papers #14



Tags:
two minute papers
Deep Visual-Semantic Alignments for Generating Image Descriptions
neural network
artificial intelligence
image captioning
image classification
Recurrent Neural Network
Artificial Neural Network
convolutional neural network
deep learning
long short-term memory
long short term memory
lstm
recurrent neural network
neuraltalk2
image recognition
neural network image recognition
neural network caption