The Linear Algebraic Structure of Word Meanings

Channel:

Subscribers:

344,000

Published on June 13, 2016 5:41:40 PM ● Video Link: https://www.youtube.com/watch?v=gaVR3WnczOQ

Duration: 1:31:56

396 views

Word embeddings are often constructed with discriminative models such as deep nets and word2vec. Mikolov et al (2013) showed that these embeddings exhibit linear structure that is useful in solving "word analogy tasks". Subsequently, Levy and Goldberg (2014) and Pennington et al (2014) tried to explain why such linear structure should arise in embeddings derived from nonlinear methods. We provide a new generative model "explanation" for various word embedding methods as well as the above-mentioned linear structure. It also gives a generative explanation of older vector space methods such as the PMI method of Church and Hanks (1990). The model has surprising predictions (e.g., the spatial isotropy of word vectors), which are empirically verified. It also directly leads to a linear algebraic understanding of how a word embedding behaves when the word is polysemous (has multiple meanings), and to recover the different meanings from the embedding. This methodology and generative model may be useful for other NLP tasks and neural models. Joint work with Sanjeev Arora, Yuanzhi Li, Yingyu Liang, and Andrej Risteski (listed in alphabetical order).

Other Videos By Microsoft Research

2016-06-13	How Much Information Does a Human Translator Add to the Original and Multi-Source Neural Translation
2016-06-13	Opportunities and Challenges in Global Network Cameras
2016-06-13	Nature in the City: Changes in Bangalore over Time and Space
2016-06-13	Making Small Spaces Feel Large: Practical Illusions in Virtual Reality
2016-06-13	Machine Learning as Creative Tool for Designing Real-Time Expressive Interactions
2016-06-13	Recent Developments in Combinatorial Optimization
2016-06-13	Computational Limits in Statistical Inference: Hidden Cliques and Sum of Squares
2016-06-13	Coloring the Universe: An Insider's Look at Making Spectacular Images of Space
2016-06-13	Towards Understandable Neural Networks for High Level AI Tasks - Part 6
2016-06-13	The 37th UW/MS Symposium in Computational Linguistics
2016-06-13	The Linear Algebraic Structure of Word Meanings
2016-06-13	Machine Learning Algorithms Workshop
2016-06-13	Interactive and Interpretable Machine Learning Models for Human Machine Collaboration
2016-06-13	Improving Access to Clinical Data Locked in Narrative Reports: An Informatics Approach
2016-06-13	Representation Power of Neural Networks
2016-06-13	Green Security Games
2016-06-13	e-NABLE: A Global Network of Digital Humanitarians on an Infrastructure of Electronic Communications
2016-06-10	Microsoft Research New England: An introduction
2016-06-06	Python+Machine Learning tutorial - Data munging for predictive modeling with pandas and scikit-learn
2016-06-06	Symposium: Deep Learning - Xiaogang Wang
2016-06-06	Symposium: Deep Learning - Leon Gatys

Tags:

microsoft research

word2vec

algorithms

nlp

Channel	Latest
Will Jace TS	6 hours ago
SergioSC	6 hours ago
NightExtreme	6 hours ago
MAF GameX	6 hours ago
Geek Mix	6 hours ago
Wolfirin Game	6 hours ago
Sjin	7 hours ago
bungg	7 hours ago
FlamingGnats	7 hours ago
DerpDream	7 hours ago
Waleczny Bigos	7 hours ago
ShadesOfNate	7 hours ago
MarthTV	7 hours ago
rynogt4	7 hours ago
Hat Films	7 hours ago
Joe, The Alternative Gamer	7 hours ago
Redzy	7 hours ago
Pittsbersk Gaming	7 hours ago
Koinsky	7 hours ago
YourGibs Gaming	7 hours ago
Lance's Fantasy Gaming	7 hours ago
Rasplin	7 hours ago
LMA TV VLOG	7 hours ago
ArtizzzMlbb	7 hours ago
Richard Yamato	7 hours ago