[BERT] Pretranied Deep Bidirectional Transformers for Language Understanding (algorithm) | TDLS

Channel:

LLMs Explained - Aggregate Intellect - AI.SCIENCE

Subscribers:

22,600

Published on November 28, 2018 6:58:26 AM ● Video Link: https://www.youtube.com/watch?v=BhlOGGzC0Q0

Duration: 53:07

83,060 views

1,000

Toronto Deep Learning Series
Host: Ada + @ML Explained - Aggregate Intellect - AI.SCIENCE
Date: Nov 6th, 2018

Aggregate Intellect is a Global Marketplace where ML Developers Connect, Collaborate, and Build.
-Connect with peers & experts at https://ai.science
-Join our Slack Community: https://join.slack.com/t/aisc-to/shared_invite/zt-f5zq5l35-PSIJTFk4v60FML177PgsPg
-Check out the user generated Recipes that provide step by step, and bite sized guides on how to do various tasks: https://ai.science/recipes

For details including slides, visit https://aisc.ai.science/events/2018-11-06

Paper: https://arxiv.org/abs/1810.04805

Speaker: Danny Luo (Dessa)
https://dluo.me/

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

We introduce a new language representation model called BERT, which stands for Bidirectional Encoder Representations from Transformers. Unlike recent language representation models, BERT is designed to pre-train deep bidirectional representations by jointly conditioning on both left and right context in all layers. As a result, the pre-trained BERT representations can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks, such as question answering and language inference, without substantial task-specific architecture modifications.
BERT is conceptually simple and empirically powerful. It obtains new state-of-the-art results on eleven natural language processing tasks, including pushing the GLUE benchmark to 80.4% (7.6% absolute improvement), MultiNLI accuracy to 86.7 (5.6% absolute improvement) and the SQuAD v1.1 question answering Test F1 to 93.2 (1.5% absolute improvement), outperforming human performance by 2.0%.

Other Videos By LLMs Explained - Aggregate Intellect - AI.SCIENCE

2018-12-16	Automated Deep Learning: Joint Neural Architecture and Hyperparameter Search (algorithm) \| AISC
2018-12-09	Automated Vulnerability Detection in Source Code Using Deep Learning (discussions) \| AISC
2018-12-09	Automated Vulnerability Detection in Source Code Using Deep Learning (algorithm) \| AISC
2018-12-05	[DQN] Human-level control through deep reinforcement learning (discussions) \| AISC Foundational
2018-12-05	Deep Q-Learning paper explained: Human-level control through deep reinforcement learning (algorithm)
2018-12-03	SMOTE, Synthetic Minority Over-sampling Technique (discussions) \| AISC Foundational
2018-12-02	TDLS - Classics: SMOTE, Synthetic Minority Over-sampling Technique (algorithm)
2018-11-30	Visualizing Data using t-SNE (algorithm) \| AISC Foundational
2018-11-30	Visualizing Data using t-SNE (discussions) \| AISC Foundational
2018-11-27	[BERT] Pretranied Deep Bidirectional Transformers for Language Understanding (discussions) \| TDLS
2018-11-27	[BERT] Pretranied Deep Bidirectional Transformers for Language Understanding (algorithm) \| TDLS
2018-11-27	Neural Image Caption Generation with Visual Attention (algorithm) \| AISC
2018-11-27	Neural Image Caption Generation with Visual Attention (discussion) \| AISC
2018-11-17	PGGAN \| Progressive Growing of GANs for Improved Quality, Stability, and Variation (part 2) \| AISC
2018-11-16	PGGAN \| Progressive Growing of GANs for Improved Quality, Stability, and Variation (part 1) \| AISC
2018-11-16	(Original Paper) Latent Dirichlet Allocation (discussions) \| AISC Foundational
2018-11-15	(Original Paper) Latent Dirichlet Allocation (algorithm) \| AISC Foundational
2018-10-31	[Transformer] Attention Is All You Need \| AISC Foundational
2018-10-25	[Original attention] Neural Machine Translation by Jointly Learning to Align and Translate \| AISC
2018-10-16	[StackGAN++] Realistic Image Synthesis with Stacked Generative Adversarial Networks \| AISC
2018-10-11	Bayesian Deep Learning on a Quantum Computer \| TDLS Author Speaking

Tags:

deep learning

meachnie learning

artificial intelligence

natural language processing

nlp

bert

bert nlp

google bert

bert google

bert model

bert deep learning

bert language model

bert explained

bert transformer

Channel	Latest
MrT-Gaming	7 hours ago
The Nishant Vibe	7 hours ago
atv	7 hours ago
TerraChannel / TerraFox	7 hours ago
LukePingu	7 hours ago
Taffe316	7 hours ago
RapCheck	7 hours ago
SOLO GAMER	7 hours ago
Olympus	8 hours ago
Gellar Gaiden	8 hours ago
JÚNIOR GAELZIN	8 hours ago
DIOSTAR GAMER	8 hours ago
RUTAX FREESTYLE	8 hours ago
Loster99	8 hours ago
NS_ART	8 hours ago
Power Art YT	8 hours ago
iin indra wicahya	8 hours ago
TechBag	8 hours ago
milkcat 밀캣 (밀크캣)	8 hours ago
imjinxss	8 hours ago
Gauging Gadgets	8 hours ago
Sonic Plasma	8 hours ago
JSChels	8 hours ago
Boom Logo Effects	8 hours ago
DIGITAL UNDERGROUND GAMING	8 hours ago