Could DeepSeek R1-Lite-Preview, o1-Preview, Claude Sonnet 3.5, or Gemini 1121 Work at Google? (LIVE)
Channel:
Subscribers:
16,500
Published on ● Video Link: https://www.youtube.com/watch?v=aMsUpZjoYdk
I give some premier LLMs questions and variations of questions from the book, "Are You Smart Enough to Work At Google?"
Some interesting findings from this session were that even slightly modifying the question's phrasing and the numbers used in each question could be enough to throw the models off the correct path.
I hope you find this kind of testing interesting!
Other Videos By Kyle Kabasares
Tags:
Artificial Intelligence
ChatGPT
OpenAI
Microsoft
Copilot
AI
Claude
Anthropic
Meta
o1
o1-preview
o1-mini
OpenAI o1
Generative AI
Mathematics
Putnam Exam