How Llama 2 works: Ghost Attention, Quality Supervised Fine-tuning, RLHF for Safety and Helpfulness

Channel:

John Tan Chong Min

Subscribers:

6,300

Published on July 27, 2023 7:57:41 AM ● Video Link: https://www.youtube.com/watch?v=koK48P7nx0Y

Category:

Let's Play

Duration: 1:20:18

1,221 views

We go through the various mechanisms behind Llama 2.
Pre-training: 2 trillion tokens
Supervised Fine-tuning: Tens of thousands of high quality samples
RLHF: To make outputs safer and more helpful
Ghost Attention: To help make the attention mechanism work for longer prompts

I do not agree with all of them, but overall Llama 2 is a great model to use!

~~~~~~~~~~~~~~~~~~~~~~~~~~~

Slides can be found here: https://github.com/tanchongmin/TensorFlow-Implementations/blob/main/Paper_Reviews/Llama%202.pdf
Part 1 here: https://www.youtube.com/watch?v=SBBFxwnABLM
How ChatGPT works: https://www.youtube.com/watch?v=wA8rjKueB3Q

Llama paper: https://arxiv.org/abs/2302.13971
Transformer Paper: https://arxiv.org/abs/1706.03762
Grouped Query Attention (GQA): https://arxiv.org/pdf/2305.13245.pdf
Rotary Positional Embeddings: https://arxiv.org/abs/2104.09864
Constitutional AI (Anthropic): https://arxiv.org/abs/2212.08073
RLHF Paper (OpenAI): https://arxiv.org/abs/2203.02155
Less is More for Alignment (LIMA): https://arxiv.org/abs/2305.11206
Phi-1 - Textbooks are all you Need (small but specialized model): https://arxiv.org/abs/2306.11644
Tiny Stories (small but specialized model): https://arxiv.org/abs/2305.07759

~~~~~~~~~~~~~~~~~~~~~~~~~~~

0:00 Ghost Attention
4:48 Llama 2 has the best Open Source Performance
7:43 Llama 2 vs Llama 1
11:23 Rotary Positional Embeddings (RoPE)
20:23 Overall Training Flow
21:11 Pre-training
25:50 Supervised Fine-Tuning (SFT)
32:52 Human Feedback to train Reward Models
47:14 Reinforcement Learning from Human Feedback (RLHF)
1:06:55 Discussion

~~~~~~~~~~~~~~~~~~~~~~~~~~~~

AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.

Discord: https://discord.gg/bzp87AHJy5
LinkedIn: https://www.linkedin.com/in/chong-min-tan-94652288/
Online AI blog: https://delvingintotech.wordpress.com/
Twitter: https://twitter.com/johntanchongmin
Try out my games here: https://simmer.io/@chongmin

Other Videos By John Tan Chong Min

2023-09-05	Symbolic Regression: Doing What LLMs cannot - Deriving Arbitrary Mathematical Relations!
2023-08-29	LLM Agents as a System (Prelim Findings Sharing): An Attempt to solve a 2-player 2D Escape Room!
2023-08-23	LLM as Pattern Machines(Part 2) - Goal Directed Decision Transformers, 10-Year Plan for Intelligence
2023-08-18	Tutorial #9: Evolution Game v2: ChatGPT (Text) and Dall-E (Image) API Integration!
2023-08-17	Tutorial #8: Create a Web Scraper using ChatGPT and Selenium!
2023-08-17	Tutorial #7: Create a Chatbot with Gradio and ChatGPT!
2023-08-15	LLMs as General Pattern Machines: Use Arbitrary Tokens to Pattern Match?
2023-08-08	Tutorial #6: LangChain & StrictJSON Implementation of Knowledge Graph Question Answer with LLMs
2023-08-08	Large Language Models and Knowledge Graphs: Merging Flexibility and Structure
2023-07-31	Tutorial #5: SymbolicAI - Automatic Retrieval Augmented Generation, Multimodal Inputs, User Packages
2023-07-27	How Llama 2 works: Ghost Attention, Quality Supervised Fine-tuning, RLHF for Safety and Helpfulness
2023-07-27	Llama 2 vs ChatGPT
2023-07-11	I-JEPA: Importance of Predicting in Latent Space
2023-07-09	Gen AI Study Group Introductory Tutorial - Transformers, ChatGPT, Prompt Engineering, Projects
2023-07-03	Tutorial #5: Strict JSON LLM Framework - Get LLM to output JSON exactly the way you want it!
2023-07-01	Tutorial #4: SymbolicAI ChatBot In-Depth Demonstration (Tool Use and Iterative Processing)
2023-06-29	How do we learn so fast? Towards a biologically plausible model for one-shot learning.
2023-06-20	LLMs as a system to solve the Abstraction and Reasoning Corpus (ARC) Challenge!
2023-06-16	Tutorial #3: Symbolic AI - Symbols, Operations, Expressions, LLM-based functions!
2023-06-13	No more RL needed! LLMs for high-level planning: Voyager + Ghost In the Minecraft
2023-06-06	Voyager - An LLM-based curriculum generator, actor and critic, with skill reuse in Minecraft!

Channel	Latest
CohhCarnage	10 hours ago
Farod Live [REDIFF - VOD]	10 hours ago
raocow	11 hours ago
CHAQN2	11 hours ago
cottagecheez	12 hours ago
Darl Apis	12 hours ago
KuyaDudz Vlog	12 hours ago
lugeyps3	13 hours ago
Donkey of Astora	13 hours ago
Permata Chanel	13 hours ago
WawanDKK	13 hours ago
bthomas96	13 hours ago
NRG-FLO Gaming	13 hours ago
NBC長崎放送	13 hours ago
Locon Gamer CLIPS	13 hours ago
ZackScottGames	13 hours ago
Fandy DS	13 hours ago
Tekken 8 Re Plays	13 hours ago
Ding Gamer	13 hours ago
Michelle eniva conde	13 hours ago
OPEN TV	13 hours ago
IGN	13 hours ago
이카리 iKARi	14 hours ago
VGAMA02	14 hours ago
ZebazPvD	14 hours ago