Ernie 2.0: A Continual Pre-Training Framework for Language Understanding | AISC

Channel:

LLMs Explained - Aggregate Intellect - AI.SCIENCE

Subscribers:

22,300

Published on September 4, 2019 12:37:47 AM ● Video Link: https://www.youtube.com/watch?v=8K1IX7VJ5Fc

Duration: 1:40:01

1,721 views

For slides and more information on the paper, visit https://aisc.ai.science/events/3919-09-03

Discussion lead: Royal Sequeira

Motivation:
Recently, pre-trained models have achieved state-of-the-art results in various language understanding
tasks, which indicates that pre-training on large-scale corpora may play a crucial role in natural
language processing. Current pre-training procedures usually focus on training the model with several
simple tasks to grasp the co-occurrence of words or sentences. However, besides co-occurring, there
exists other valuable lexical, syntactic and semantic information in training corpora, such as named
entity, semantic closeness and discourse relations. In order to extract to the fullest extent, the lexical,
syntactic and semantic information from training corpora, we propose a continual pre-training framework named ERNIE 2.0 which builds and learns incrementally pre-training tasks through constant
multi-task learning. Experimental results demonstrate that ERNIE 2.0 outperforms BERT and XLNet
on 16 tasks including English tasks on GLUE benchmarks and several common tasks in Chinese. The
source codes and pre-trained models have been released at https://github.com/PaddlePaddle/ERNIE.

Other Videos By LLMs Explained - Aggregate Intellect - AI.SCIENCE

2019-09-25	MLOps: Overview of Machine Learning Operations on the Cloud \| AISC
2019-09-24	Lookahead Optimizer: k steps forward, 1 step back
2019-09-24	Similarity of neural network representations revisited
2019-09-23	Detecting Customer Complaint Escalation w/ Recurrent Neural Networks & Manually-Engineered Features
2019-09-23	Graph Normalizing Flows
2019-09-23	CNN Architectures for Large-Scale Audio Classification \| AISC
2019-09-22	2019 AI Squared Forum Paper Track \| AISC
2019-09-16	Making of a conversational agent platform \| AISC
2019-09-09	A Survey of Singular Learning \| AISC
2019-09-04	Overview of Reinforcement Learning \| AISC
2019-09-03	Ernie 2.0: A Continual Pre-Training Framework for Language Understanding \| AISC
2019-08-28	Consistency by Agreement in Zero-shot Neural Machine Translation \| AISC
2019-08-26	TensorFuzz: Debugging Neural Networks with Coverage-Guided Fuzzing \| AISC
2019-08-21	Science of science: Identifying Fundamental Drivers of Science \| AISC
2019-08-19	AI Product Stream Meet and Greet \| AISC
2019-08-12	[Original ResNet paper] Deep Residual Learning for Image Recognition \| AISC
2019-08-11	[GAT] Graph Attention Networks \| AISC Foundational
2019-08-06	XLNet: Generalized Autoregressive Pretraining for Language Understanding \| AISC
2019-07-31	Overview of Generative Adversarial Networks \| AISC
2019-07-29	Multi-Armed Bandit Strategies for Non-Stationary Reward Distributions and Delayed Feedback Processes
2019-07-22	AISC Abstract Night

Channel	Latest
SEROTHS TV	6 hours ago
BIGCOKE NO.1	6 hours ago
MeongCit	6 hours ago
Uncle Rusty	6 hours ago
ともぞう ESPORTS ch	6 hours ago
Ryan Levair	6 hours ago
Abell & Atene	6 hours ago
no comment Games	6 hours ago
Aru Dark Shinobi	6 hours ago
cor955	7 hours ago
adamhellstrom	7 hours ago
לימיטד אדישן	7 hours ago
Top Game Online	7 hours ago
MG	7 hours ago
BezTesta	7 hours ago
シスター・クレア -SisterClaire-	7 hours ago
因幡はねる / Haneru Channel【ななしいんく】	7 hours ago
Nessiroj	7 hours ago
Tiametmarduk	7 hours ago
SoraCh. ときのそらチャンネル	7 hours ago
Retro Toshi レトロトシ	7 hours ago
Oyuncu Aile	7 hours ago
Psiko 9000	7 hours ago
Gameplayerin	7 hours ago
Chubby muffin Gameplay	7 hours ago