Sleep Time Compute - AI That "Thinks" 24/7 (Breakthrough)

Channel:

Matthew Berman

Subscribers:

477,000

Published on April 25, 2025 4:22:26 PM ● Video Link: https://www.youtube.com/watch?v=FGRxw3ACIkw

Duration: 0:00

79,634 views

2,560

👉 Access all top AIs for $10 on https://mammouth.ai/

Join My Newsletter for Regular AI Updates 👇 🏼
https://forwardfuture.ai/

My Links 🔗
👉 🏻 Subscribe: / @matthew_berman
👉 🏻 Twitter: https://twitter.com/matthewberman
👉 🏻 Discord: https://discord.gg/xxysSXBxFW
👉 🏻 Patreon: https://patreon.com/MatthewBerman
👉 🏻 Instagram: https://www.instagram.com/matthewberman_ai
👉 🏻 Threads: https://www.threads.net/@matthewberman_ai
👉 🏻 LinkedIn: https://www.linkedin.com/company/forward-future-ai

Media/Sponsorship Inquiries ✅ https://bit.ly/44TC45VV

0:00 Intro: AI That Thinks BEFORE You Ask?
0:13 Introducing Sleep-Time Compute
0:59 The Problem with Standard Test-Time Compute (Cost & Latency)
2:58 Stateful LLM Applications (Code, Docs, Chat)
3:33 Sleep Time vs. Test Time (Diagram Explained)
4:51 Why Sleep-Time is More Cost-Effective
6:00 Defining Sleep-Time Compute
6:26 Sponsor: Mammoth (Generative AI Platform)
7:18 Paper Details: How They Tested Non-Reasoning Models
9:24 Benchmarking Sleep-Time (The Juggle Example)
10:05 Models Used (GPT-4o, Claude, DeepSeek, etc.)
10:25 Results: Non-Reasoning Models (Graphs)
12:18 Results: Reasoning Models (Graphs)
13:39 Sleep Time vs. Parallel Sampling (A Big Issue)
14:41 Scaling Sleep-Time Compute
15:45 Amortizing Cost Across Queries (Why it's Cheaper!)
16:48 Predictable Queries Benefit Most
18:04 Paper Summary & Future Directions
18:40 Outro & Newsletter

Other Videos By Matthew Berman

2025-05-02	The AI Energy CRISIS
2025-05-02	Open Source AI is getting insane
2025-05-02	Human Robots in YOUR Home
2025-05-01	Microsoft's CEO says SAAS IS DYING
2025-05-01	Microsoft's CEO says AI will replace Excel!
2025-05-01	Will AI Take Over Software Engineering? Microsoft Seems to Think So!
2025-05-01	Zuck’s Stunning Claim About Meta’s Self-Improving AI
2025-04-30	NEW Deepseek Model is INSANE at Math!
2025-04-29	Qwen3 is a fantastic open-source model
2025-04-28	Stark warning from Anthropic’s CEO
2025-04-25	Sleep Time Compute - AI That "Thinks" 24/7 (Breakthrough)
2025-04-23	Gemini 2.5 Flash has insane potential... (Google Keeps WINNING)
2025-04-18	The Industry Reacts to o3 and o4!
2025-04-18	AI News: Gemini 2.5 Flash, o3 and o4, Claude Research, Kling 2.0, and More!
2025-04-16	GPT-o4 is HERE - OpenAI is BACK!
2025-04-14	GPT-4.1 is HERE! The ultimate coding model
2025-04-13	AI News: OpenAI Dropping Tomorrow! Open Source o3 Level Model, Midjourney V7, and More!
2025-04-09	Google Cloud Next - Gemini 2.5 Pro EVERYWHERE
2025-04-08	“Thinking” AI might not actually think…
2025-04-07	Major Llama DRAMA
2025-04-06	The Industry Reacts to Llama 4 - "Nearly INFINITE"

Channel	Latest
Superbanana Gaming	6 hours ago
Brian Kibler	6 hours ago
TheFishou	6 hours ago
Chocoblox	6 hours ago
StormwindGames	6 hours ago
W1ndz	6 hours ago
HaloBT	6 hours ago
Grimmmz	6 hours ago
Purist	6 hours ago
Pro Racing Gamer	6 hours ago
MarkstromTV	6 hours ago
BulletGang45	6 hours ago
JAXXON MEDIA HOUSE	6 hours ago
Couches n' Cables	6 hours ago
JJOR64	6 hours ago
Nessiroj	6 hours ago
Akatsuki	6 hours ago
Tekno Felaket	6 hours ago
Vara Dark - Dark Titan Media	7 hours ago
Los juegos del tito Mor	7 hours ago
BANKII	7 hours ago
Legioner-FX	7 hours ago
MxPlay	7 hours ago
Smarty Player	7 hours ago
ضفدع جيمر	7 hours ago