Torturing LLMs on Thanksgiving with "Mathematical Physics" by Donald H. Menzel (LIVE)

Subscribers:
16,500
Published on ● Video Link: https://www.youtube.com/watch?v=JSK-jgHHkhc



Duration: 0:00
2,151 views
86


I test some reasoning models with problems from the book "Mathematical Physics" by Donald H. Menzel. I mostly focus on giving it two problems in this stream:

1. Solving for the eigenvalues of a 4 x 4 matrix.
2. Calculating the Laplacian of a multi-variable function in spherical coordinates.




Other Videos By Kyle Kabasares


2024-12-23ChatGPT's Santa Mode Helps Me Wrap a Christmas Gift (Advanced Voice + Video Test)
2024-12-2212/21/2024 Live Stream Re-Upload w/Timestamps: Sora, Veo 2, o3 ARC-AGI, Apollo Safety Research
2024-12-21Live Discussion: 12 Days of OpenAI, o3 surpasses 75% on ARC-AGI, Gemini-Flash 2, etc
2024-12-20What I Do for NASA + the Bay Area Environmental Research Institute
2024-12-16Discussing o1-Pro's Performance on the 2024 Putnam Math Competition (LIVE)
2024-12-15Sora kinda sucks...but it's HILARIOUS
2024-12-13I'm Deleting Several LLM Math/Physics Testing Videos Today
2024-12-13Follow Up to "I'm Deleting Several LLM Math/Physics Testing Videos Today"
2024-12-08Reacting to OpenAI's Sora Release and Google's Quantum Computing Breakthrough (12/9/2024)
2024-12-02PSA online angry men: Women don’t owe you anything. Thank you for coming to my TED talk.
2024-11-29Torturing LLMs on Thanksgiving with "Mathematical Physics" by Donald H. Menzel (LIVE)
2024-11-23Zettili’s quantum mechanics textbook is the #goat #physics #quantumphysics
2024-11-23Are Any of These LLMs Smart Enough for Google?
2024-11-22Could DeepSeek R1-Lite-Preview, o1-Preview, Claude Sonnet 3.5, or Gemini 1121 Work at Google? (LIVE)
2024-11-22Gemini Experimental 1121 Did ~10 Weeks of Quantum Mechanics Research in ~10 Minutes
2024-11-21I Made DeepSeek-R1-Lite-Preview and Gemini Experimental 1114 Solve This Integral
2024-11-19Artificial Intelligence + Quantum Computing = ???
2024-11-15Should I share my random late night thoughts more often? 😅 #AI #artificialintelligence #climate
2024-11-12Life 3.0 by Max Tegmark is a must read for anyone interested in AI.
2024-11-09I Used Perplexity AI Search for the First Time
2024-11-08Reflecting on OpenAI’s o1-Full, preview, and mini Models