Consistency by Agreement in Zero-shot Neural Machine Translation | AISC

Channel:

LLMs Explained - Aggregate Intellect - AI.SCIENCE

Subscribers:

22,300

Published on August 28, 2019 5:10:15 PM ● Video Link: https://www.youtube.com/watch?v=2vR06ih4010

Duration: 1:02:43

436 views

For slides and more information on the paper, visit https://aisc.ai.science/events/3919-08-28

Discussion lead: Maruan Al-Shedivat

Motivation:
Generalization and reliability of multilingual translation often highly depend on the amount of available parallel data for each language pair of interest. In this paper, we focus on zero-shot generalization---a challenging setup that tests models on translation directions they have not been optimized for at training time. To solve the problem, we (i) reformulate multilingual translation as probabilistic inference, (ii) define the notion of zero-shot consistency and show why standard training often results in models unsuitable for zero-shot tasks, and (iii) introduce a consistent agreement-based training method that encourages the model to produce equivalent translations of parallel sentences in auxiliary languages. We test our multilingual NMT models on multiple public zero-shot translation benchmarks (IWSLT17, UN corpus, Europarl) and show that agreement-based learning often results in 2-3 BLEU zero-shot improvement over strong baselines without any loss in performance on supervised translation directions.

Other Videos By LLMs Explained - Aggregate Intellect - AI.SCIENCE

2019-09-24	Lookahead Optimizer: k steps forward, 1 step back
2019-09-24	Similarity of neural network representations revisited
2019-09-23	Detecting Customer Complaint Escalation w/ Recurrent Neural Networks & Manually-Engineered Features
2019-09-23	Graph Normalizing Flows
2019-09-23	CNN Architectures for Large-Scale Audio Classification \| AISC
2019-09-22	2019 AI Squared Forum Paper Track \| AISC
2019-09-16	Making of a conversational agent platform \| AISC
2019-09-09	A Survey of Singular Learning \| AISC
2019-09-04	Overview of Reinforcement Learning \| AISC
2019-09-03	Ernie 2.0: A Continual Pre-Training Framework for Language Understanding \| AISC
2019-08-28	Consistency by Agreement in Zero-shot Neural Machine Translation \| AISC
2019-08-26	TensorFuzz: Debugging Neural Networks with Coverage-Guided Fuzzing \| AISC
2019-08-21	Science of science: Identifying Fundamental Drivers of Science \| AISC
2019-08-19	AI Product Stream Meet and Greet \| AISC
2019-08-12	[Original ResNet paper] Deep Residual Learning for Image Recognition \| AISC
2019-08-11	[GAT] Graph Attention Networks \| AISC Foundational
2019-08-06	XLNet: Generalized Autoregressive Pretraining for Language Understanding \| AISC
2019-07-31	Overview of Generative Adversarial Networks \| AISC
2019-07-29	Multi-Armed Bandit Strategies for Non-Stationary Reward Distributions and Delayed Feedback Processes
2019-07-22	AISC Abstract Night
2019-07-15	The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words & Sentences From Natural Supervision

Channel	Latest
Goose Sports	6 hours ago
HIRO's channel	6 hours ago
ComicBookCast2	6 hours ago
Rainbow Six Brasil	6 hours ago
GLA	6 hours ago
【ゲーム速報「攻略・ゆっくり実況・ゆっくり解説・Switch・PS5・XBOX・steam・実況】	6 hours ago
GetRektNoob	7 hours ago
YoSoyRick	7 hours ago
SkiPNhO	7 hours ago
Wojack Toter	7 hours ago
ZIVOLL CHANNEL GAMINGZ	7 hours ago
XGN	7 hours ago
SwitchCorner	7 hours ago
Grizzy	7 hours ago
Callam001	7 hours ago
LUCKYY 10P	7 hours ago
Lone Wolf	7 hours ago
NeiKoohh	7 hours ago
Super Brian Games	7 hours ago
Krsman3	7 hours ago
NicsterV	7 hours ago
OhBIGz	7 hours ago
1ofHenrysMen	7 hours ago
EL GEORGE NEW!!!	7 hours ago
Zwag Xerath	7 hours ago