Embeddings Walkthrough (Part 1) - Bag of Words to word2vec to Transformer contextual embeddings

Channel:

John Tan Chong Min

Subscribers:

5,450

Published on February 6, 2024 7:58:47 AM ● Video Link: https://www.youtube.com/watch?v=gVZryxJRdSY

Category:

Walkthrough

Duration: 1:37:30

636 views

How did Large Language Models (LLMs) become so good at capturing the essences of the text?

Key idea: Embeddings

Mark my words - embeddings and context-dependent embeddings are the key reason why transformers work so well!

We'll go through a historical overview of how embeddings are derived, starting from Bag of Words, to word2vec, to Transformers, and to some of my more recent experiments on context-dependent embeddings and multiple abstraction spaces!

~~~~
Part 2 here: https://www.youtube.com/watch?v=YZTThlPg0rU

My slides: https://github.com/tanchongmin/strictjson/blob/main/Experiments/Embeddings%20Walkthrough.pdf

Bag of words: https://www.researchgate.net/publication/338511771_An_Overview_of_Bag_of_WordsImportance_Implementation_Applications_and_Challenges
Word2vec: https://courses.cs.washington.edu/courses/csep517/20wi/slides/csep517wi20-WordEmbeddings.pdf
word2vec paper: https://arxiv.org/abs/1301.3781
Transformer paper: https://arxiv.org/abs/1706.03762
Vision Transformer paper: https://arxiv.org/abs/2010.11929
Memorising Transformer paper (for that nice token prediction visualisation): https://arxiv.org/abs/2203.08913
Text and Code Embeddings by Contrastive Pre-training (OpenAI embeddings paper): https://arxiv.org/abs/2201.10005

~~~~

0:00 Introduction
1:30 Bag of words
5:55 Continuous vectors for embedding
8:42 word2vec
20:24 Next-token prediction
24:43 Transformer embeddings
43:17 Comparison: Image token embeddings
51:52 Recap on Transformer embeddings
55:47 Cosine Similarity
59:42 Sentence Embeddings
1:07:39 Why Contrastive Learning is Bad
1:11:11 Mismatch between next-token prediction and sentence meaning embedding prediction
1:19:18 Insight: Multiple Abstraction Space Prediction for Embeddings
1:27:03 Discussion

~~~~~~~~~~~~

AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.

Discord: https://discord.gg/bzp87AHJy5
LinkedIn: https://www.linkedin.com/in/chong-min-tan-94652288/
Online AI blog: https://delvingintotech.wordpress.com/
Twitter: https://twitter.com/johntanchongmin
Try out my games here: https://simmer.io/@chongmin

Other Videos By John Tan Chong Min

2024-04-22	Tutorial #14: Write latex papers with LLMs such as Llama 3!
2024-04-16	SORA Deep Dive: Predict patches from text, images or video
2024-04-09	OpenAI CLIP Embeddings: Walkthrough + Insights
2024-03-26	TaskGen - LLM Agentic Framework that Does More, Talks Less: Shared Variables, Memory, Global Context
2024-03-18	CRADLE (Part 2): An AI that can play Red Dead Dedemption 2. Reflection, Memory, Task-based Planning
2024-03-11	CRADLE (Part 1) - AI that plays Red Dead Redemption 2. Towards General Computer Control and AGI
2024-03-05	TaskGen - A Task-based Agentic Framework using StrictJSON at the core
2024-02-27	SymbolicAI / ExtensityAI Paper Overview (Part 2) - Evaluation Benchmark Discussion!
2024-02-20	SymbolicAI / ExtensityAI Paper Overview (Part 1) - Key Philosophy Behind the Design - Symbols
2024-02-13	Embeddings Walkthrough (Part 2): Context-Dependent Embeddings, Shifting Embedding Space
2024-02-06	Embeddings Walkthrough (Part 1) - Bag of Words to word2vec to Transformer contextual embeddings
2024-01-29	V* - Better than GPT-4V? Iterative Context Refining for Visual Question Answer!
2024-01-23	AutoGen: A Multi-Agent Framework - Overview and Improvements
2024-01-09	AppAgent: Using GPT-4V to Navigate a Smartphone!
2024-01-08	Tutorial #13: StrictJSON, my first Python Package! - Get LLMs to output into a working JSON!
2023-12-20	"Are you smarter than an LLM?" game speedrun
2023-12-08	Is Gemini better than GPT4? Self-created benchmark - Fact Retrieval/Checking, Coding, Tool Use
2023-12-04	Learning, Fast and Slow: 10 Years Plan - Memory Soup, Hier. Planning, Emotions, Knowledge Sharing
2023-12-01	Tutorial #12: Use ChatGPT and off-the-shelf RAG on Terminal/Command Prompt/Shell - SymbolicAI
2023-11-20	JARVIS-1: Multi-modal (Text + Image) Memory + Decision Making with LLMs in MineCraft!
2023-11-20	Tutorial #11: Virtual Persona from Documents, Multi-Agent Chat, Text-to-Speech to hear your Personas

Channel	Latest
Hil6175_rblx	6 hours ago
Hijuga	6 hours ago
강자	6 hours ago
Beverlyビバリー	6 hours ago
Garena Free Fire VN	6 hours ago
AgentJ Gaming	6 hours ago
Galih Dys	6 hours ago
Soccer Gameplay	7 hours ago
POWER OF GAME	7 hours ago
笠希々	7 hours ago
Dunkelschloss	7 hours ago
Hendri Pusi	7 hours ago
Yusuke Yamamoto [Otaku President]	7 hours ago
よっしぃ game channel	7 hours ago
フリーランスなおきち広島弁ゲーム実況	7 hours ago
Inazuma Hissatsu	7 hours ago
Atomix Knight	7 hours ago
阿德 (藝圓創)	7 hours ago
MRSyonicBoom	7 hours ago
Ray noa	7 hours ago
Tama Ch	7 hours ago
aulddragon	7 hours ago
やまだちゃんねる	7 hours ago
DJ Neon Panda And Scorch Gaming	7 hours ago
Krosmaster Team Spain	7 hours ago