Sleep Time Compute - AI That "Thinks" 24/7 (Breakthrough)

Subscribers:
477,000
Published on ● Video Link: https://www.youtube.com/watch?v=FGRxw3ACIkw



Duration: 0:00
79,634 views
2,560


๐Ÿ‘‰ Access all top AIs for $10 on https://mammouth.ai/

Join My Newsletter for Regular AI Updates ๐Ÿ‘‡ ๐Ÿผ
https://forwardfuture.ai/

My Links ๐Ÿ”—
๐Ÿ‘‰ ๐Ÿป Subscribe: ย ย ย /ย @matthew_bermanย ย 
๐Ÿ‘‰ ๐Ÿป Twitter: https://twitter.com/matthewberman
๐Ÿ‘‰ ๐Ÿป Discord: https://discord.gg/xxysSXBxFW
๐Ÿ‘‰ ๐Ÿป Patreon: https://patreon.com/MatthewBerman
๐Ÿ‘‰ ๐Ÿป Instagram: https://www.instagram.com/matthewberman_ai
๐Ÿ‘‰ ๐Ÿป Threads: https://www.threads.net/@matthewberman_ai
๐Ÿ‘‰ ๐Ÿป LinkedIn: https://www.linkedin.com/company/forward-future-ai

Media/Sponsorship Inquiries โœ… https://bit.ly/44TC45VV

0:00 Intro: AI That Thinks BEFORE You Ask?
0:13 Introducing Sleep-Time Compute
0:59 The Problem with Standard Test-Time Compute (Cost & Latency)
2:58 Stateful LLM Applications (Code, Docs, Chat)
3:33 Sleep Time vs. Test Time (Diagram Explained)
4:51 Why Sleep-Time is More Cost-Effective
6:00 Defining Sleep-Time Compute
6:26 Sponsor: Mammoth (Generative AI Platform)
7:18 Paper Details: How They Tested Non-Reasoning Models
9:24 Benchmarking Sleep-Time (The Juggle Example)
10:05 Models Used (GPT-4o, Claude, DeepSeek, etc.)
10:25 Results: Non-Reasoning Models (Graphs)
12:18 Results: Reasoning Models (Graphs)
13:39 Sleep Time vs. Parallel Sampling (A Big Issue)
14:41 Scaling Sleep-Time Compute
15:45 Amortizing Cost Across Queries (Why it's Cheaper!)
16:48 Predictable Queries Benefit Most
18:04 Paper Summary & Future Directions
18:40 Outro & Newsletter