Can OpenAI's o1-preview Ace the 2023 Putnam Exam?

Subscribers:
17,000
Published on ● Video Link: https://www.youtube.com/watch?v=2xYosqVjROc



Duration: 0:00
3,200 views
127


I put the o1-preview and o1-mini model's math abilities to the test by giving them the 2023 Putnam Math Exam, supposedly the hardest math test given every year. This test is outside the model's training data, since their knowledge cutoff is October 2023 and the test was released in December 2023.

Putnam Archive: https://kskedlaya.org/putnam-archive/

** POST STREAM **
o1-preview's final score: 49/120.
Median Putnam exam taker's score: 10/120.

This would have placed o1-preview just outside the top 100 of all test takers (over 4000).

Take the final score with a grain of salt, I had o1-mini act as a grader and asked it to compare the official Putnam solution and grade it according to the rough Putnam scoring guidelines. Apparently one of my viewers tried the same thing and o1-preview would’ve scored higher in their case (top 50)




Other Videos By Kyle Kabasares


2024-10-21Google's Notebook LM Created a Podcast of My Physics PhD Thesis
2024-10-18Live from Worldcoin’s “A New World”: Keynote Address with Alex Blania and Sam Altman (OpenAI CEO)
2024-10-18I Went to Sam Altman’s Worldcoin Event As a Non-Techie
2024-10-17Live from WorldCoin’s: “A New World” in San Francisco, CA (Pre-Keynote, Breakfast)
2024-10-16The Time a Professor Said My Physics PhD Cohort Would Fail (Watch Before Entering a PhD Program)
2024-10-14I Took an Official Mensa Intelligence Test | An Honest Conversation About Intelligence
2024-10-13My Reaction to the 2024 Nobel Prize in Chemistry (as a non-chemist)
2024-10-11ChatGPT's Advanced Voice Speaks in My Grandmother's Filipino Language (Bisaya/Cebuano) Again!
2024-10-09Addendum to the Nobel Prize in Physics video, I acknowledge Prof. Hopfield’s physics background!
2024-10-08The 2024 #NobelPrize in #Chemistry was announced today! This is my 60-second reaction. #NobelWeek
2024-10-08Can OpenAI's o1-preview Ace the 2023 Putnam Exam?
2024-10-08My Reaction to the 2024 Nobel Prize in Physics as a Physics PhD Holder (Confused)
2024-10-07I Asked ChatGPT's Advanced Voice Mode to Sing Again...(Funny)
2024-10-06Getting Two ChatGPT Advanced Voices to Act in Multiple Roles (Comedy, Drama, Horror)
2024-10-05Testing Microsoft Copilot and ChatGPT-4o Canvas
2024-10-04Trying ChatGPT-4o with Canvas for the First Time!
2024-10-03I Made 2 ChatGPT Advanced Voice Models Roleplay with Each Other
2024-10-03Vlogging my OpenAI DevDay 2024 Experience!
2024-10-02Post-OpenAI Dev Day 2024 Thoughts
2024-10-02OpenAI Dev 2024 Day: Fireside chat with CEO Sam Altman
2024-09-30ChatGPT's Advanced Voice Speaks in My Grandmother's Filipino Dialect (Bisaya/Cebuano)