Plug and Play Language Models: A Simple Approach to Controlled Text Generation | AISC

Channel:

LLMs Explained - Aggregate Intellect - AI.SCIENCE

Subscribers:

22,300

Published on January 14, 2020 1:46:23 AM ● Video Link: https://www.youtube.com/watch?v=q3Q_LTetx9o

Duration: 2:36:38

3,018 views

For slides and more information on the paper, visit https://aisc.ai.science/events/2020-01-13

Discussion lead: Raheleh Makki
Discussion facilitator(s): Gordon Gibson, Royal Sequeira + Salman Mohammed

Motivation:
Large transformer-based language models (LMs) trained on huge text corpora have shown unparalleled generation capabilities. However, controlling attributes of the generated language (e.g. switching topic or sentiment) is difficult without modifying the model architecture or fine-tuning on attribute-specific data and entailing the significant cost of retraining. We propose a simple alternative: the Plug and Play Language Model (PPLM) for controllable language generation, which combines a pretrained LM with one or more simple attribute classifiers that guide text generation without any further training of the LM. In the canonical scenario we present, the attribute models are simple classifiers consisting of a user-specified bag of words or a single learned layer with 100,000 times fewer parameters than the LM. Sampling entails a forward and backward pass in which gradients from the attribute model push the LM's hidden activations and thus guide the generation. Model samples demonstrate control over a range of topics and sentiment styles, and extensive automated and human annotated evaluations show attribute alignment and fluency. PPLMs are flexible in that any combination of differentiable attribute models may be used to steer text generation, which will allow for diverse and creative applications beyond the examples given in this paper.

Other Videos By LLMs Explained - Aggregate Intellect - AI.SCIENCE

2020-02-20	Build and Deploy Machine Learning Models \| MLOps Overview
2020-02-19	Quantifying the dynamics of failure across science, startups and security \| AISC
2020-02-18	Deep Learning for Symbolic Mathematics \| AISC
2020-02-12	Visualizing and measuring the geometry of BERT \| AISC
2020-02-11	Attention is not not explanation + Character Eyes: Seeing Language through Character-Level Taggers \|
2020-02-10	Single Headed Attention RNN: Stop Thinking With Your Head \| AISC
2020-01-27	Identifying Big ML product opportunities inside Big organizations \| AISC
2020-01-23	Machine Learning in Cyber Security, Overview \| AISC
2020-01-22	BottleSum: Unsupervised & Self-supervised Sentence Summarization w/ Information Bottleneck Principle
2020-01-20	A Hybrid GA-PSO Method for Evolving Architecture and Short Connections of Deep Convolutional Neural
2020-01-13	Plug and Play Language Models: A Simple Approach to Controlled Text Generation \| AISC
2020-01-08	Overview of Modern Anomaly and Novelty Detection \| AISC
2020-01-06	Annotating Object Instances With a Polygon RNN \| AISC
2019-12-11	Predicting translational progress in biomedical research \| AISC
2019-12-09	AlphaStar explained: Grandmaster level in StarCraft II with multi-agent RL
2019-12-04	How Can We Be So Dense? The Benefits of Using Highly Sparse Representations \| AISC
2019-12-02	[RoBERT & ToBERT] Hierarchical Transformers for Long Document Classification \| AISC
2019-11-25	[OpenAI] Solving Rubik's Cube with a Robot Hand \| AISC
2019-11-18	Top-K Off-Policy Correction for a REINFORCE Recommender System \| AISC
2019-11-13	Overview of Unsupervised & Semi-supervised learning \| AISC
2019-11-11	Building products for Continous Delivery in Machine Learning \| AISC

Channel	Latest
BossKing lol	6 hours ago
fantayzia	6 hours ago
Canal MangaQ	6 hours ago
chocoTaco	6 hours ago
AyChristene	6 hours ago
Guillaume Brien	6 hours ago
Canal do Saullo	7 hours ago
TDM_Heyzeus	7 hours ago
Wappen	7 hours ago
The One & The Only, Triple Da G.O.D!	7 hours ago
Kartoffel König	7 hours ago
Antimatéria	7 hours ago
vLADOPARD 404	7 hours ago
The Breakdown	7 hours ago
theScore esports	7 hours ago
Isabelle Lee	7 hours ago
Faeldray	7 hours ago
D3rKommi	7 hours ago
Sliver	7 hours ago
#StruggleNation	7 hours ago
MuriloRT - Canal novo @murics_	7 hours ago
IGN Brasil	7 hours ago
TrU3Ta1ent	7 hours ago
Captain TigerLily	7 hours ago
Qr Juegos	7 hours ago