Code DeepSeek V3 From Scratch in Python - Full Course

Subscribers:
10,700,000
Published on ● Video Link: https://www.youtube.com/watch?v=5avSMc79V-w



Duration: 0:00
70,279 views
2,625


This course is a comprehensive guide to understanding and implementing DeepSeek V3, a cutting-edge deep learning model. ‪@vukrosic‬ shares step-by-step coding instructions and theoretical insights.

🔗 paper - https://arxiv.org/pdf/2412.19437

💻 https://github.com/deepseek-ai/DeepSeek-V3/tree/main/inference - code by DeepSeek, just a few small changes made at the end of the video to Transformer class to for training, as this is for inference, so you need to make them manually or screenshot the video and ask AI to make the changes to this code

❤ ️ Try interactive AI courses we love, right in your browserhttps://scrimba.com/freeCodeCamp-AIAI (Made possible by a grant from our friends at Scrimba)

⭐ ️ Contents ⭐ ️
⌨ ️ (0:00:00) Intro
⌨ ️ (0:01:40) Attention Mechanism
⌨ ️ (0:13:34) Query, Key, Value
⌨ ️ (0:34:11) KV Cache
⌨ ️ (0:39:06) Multihead Latent Attention (MLA)
⌨ ️ (0:58:53) Coding MLA
⌨ ️ (1:28:41) RoPE
⌨ ️ (1:55:44) Coding KV Cache
⌨ ️ (2:00:25) MLA forward
⌨ ️ (2:28:24) MoE, Gate
⌨ ️ (2:49:25) Gate code
⌨ ️ (3:09:10) MoE code
⌨ ️ (3:28:36) Transformer Blocks

🎉 Thanks to our Champion and Sponsor supporters:
👾 Drake Milly
👾 Ulises Moralez
👾 Goddard Tan
👾 David MG
👾 Matthew Springman
👾 Claudio
👾 Oscar R.
👾 jedi-or-sith
👾 Nattira Maneerat
👾 Justin Hual

--

Learn to code fohttps://www.freecodecamp.org/ob: https://www.freecodecamp.org

Read hundrhttps://freecodecamp.org/newsg: https://freecodecamp.org/news




Other Videos By freeCodeCamp.org


2025-04-22Essential Machine Learning and AI Concepts Animated
2025-04-21From fast food worker to cybersecurity engineer with Tae'lur Alexis [Podcast #169]
2025-04-17Learn Laravel by Building a Medium Clone – Tutorial
2025-04-16Data Engineering with Python and AI/LLMs – Data Loading Tutorial
2025-04-15From Accountant to Data Engineer with Alyson La [Podcast #168]
2025-04-10Train Your Own LLM – Tutorial
2025-04-09Lynx Tutorial – JS Framework for Cross Platform Development
2025-04-08C++ Setup and Installation Tools – CMake, vcpkg, Docker & Copilot
2025-04-04From drop-out to software architect with Jason Lengstorf [Podcast #167]
2025-04-02Full Stack Instagram Clone with Laravel and MongoDB – Tutorial
2025-04-01Code DeepSeek V3 From Scratch in Python - Full Course
2025-03-28From broke musician to working dev. How college drop-out Ryan Furrer learned to code [Podcast #166]
2025-03-27Excel Formulas & Functions You Should Know [Full Course]
2025-03-25Microservices in Nest.js – JavaScript Tutorial
2025-03-21From hating coding to programming satellites at age 37 – Francesco Ciulla interview [Podcast #165]
2025-03-19Learn ANY Language with AI (Learn English, Learn Spanish, Learn Mandarin Chinese, and more)
2025-03-18Build a Full Stack AI Note Taking App with Next.js and Supabase – Tutorial
2025-03-14How to become a self-taught developer while supporting a family [Podcast #164]
2025-03-13AWS Cognito Course – Authentication and Authorization
2025-03-12JavaScript Essentials Course
2025-03-11DeepSeek R1 Theory Tutorial – Architecture, GRPO, KL Divergence