[ML News] GPT-4 solves MIT Exam with 100% ACCURACY | OpenLLaMA 13B released

Subscribers:
284,000
Published on ● Video Link: https://www.youtube.com/watch?v=Tkijsu129M0



Duration: 31:05
71,140 views
1,803


#gpt4 #mit #ai

A new paper claims to use GPT-4 to solve 100% of a set of MIT university exercises. Some people are skeptic and their investigations reveal more than one problem with this paper...

OUTLINE:
0:00 - ChatGPT gives out Windows 10 keys
0:30 - MIT exam paper
2:50 - Prompt engineering
5:30 - Automatic grading
6:45 - Response by other MIT students
8:30 - Unsolvable questions
10:50 - Duplicates
13:30 - Cascading the heuristics
22:40 - Other problems
29:25 - OpenLLaMA 13B published

References:
https://twitter.com/immasiddtweets/status/1669721470006857729/photo/1
https://arxiv.org/abs/2306.08997
https://arxiv.org/pdf/2306.08997.pdf
https://flower-nutria-41d.notion.site/No-GPT4-can-t-ace-MIT-b27e6796ab5a48368127a98216c76864
https://github.com/idrori/MITQ/commit/3feee1026318e537c0ad27968001ef76e4a36890
https://twitter.com/hardmaru/status/1670246674760077312
https://twitter.com/giffmana/status/1670258748286472193
https://twitter.com/T3816440886465/status/1670127224131862531
https://twitter.com/qrdl/status/1669856336652414977
https://www.chegg.com/homework-help/questions-and-answers/consider-mdp-set-possible-states-mathcal-s-0-1-2-3-set-possible-actions-mathcal-b-c--rewar-q111042613
https://github.com/openlm-research/open_llama
https://huggingface.co/openlm-research/open_llama_13b

Links:
Homepage: https://ykilcher.com
Merch: https://ykilcher.com/merch
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
Discord: https://ykilcher.com/discord
LinkedIn: https://www.linkedin.com/in/ykilcher

If you want to support me, the best thing to do is to share out the content :)

If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar: https://www.subscribestar.com/yannickilcher
Patreon: https://www.patreon.com/yannickilcher
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n




Other Videos By Yannic Kilcher


2023-11-08I built the most expensive CPU ever! (Every instruction is a prompt)
2023-10-24OpenAssistant is Completed
2023-10-14Efficient Streaming Language Models with Attention Sinks (Paper Explained)
2023-10-07Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution (Paper Explained)
2023-09-12Retentive Network: A Successor to Transformer for Large Language Models (Paper Explained)
2023-09-03Reinforced Self-Training (ReST) for Language Modeling (Paper Explained)
2023-08-15[ML News] LLaMA2 Released | LLMs for Robots | Multimodality on the Rise
2023-08-14How Cyber Criminals Are Using ChatGPT (w/ Sergey Shykevich)
2023-08-13Recipe AI suggests FATAL CHLORINE GAS Recipe
2023-08-12DeepFloyd IF - Pixel-Based Text-to-Image Diffusion (w/ Authors)
2023-06-20[ML News] GPT-4 solves MIT Exam with 100% ACCURACY | OpenLLaMA 13B released
2023-06-06Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust (Explained)
2023-06-02RWKV: Reinventing RNNs for the Transformer Era (Paper Explained)
2023-05-23Tree of Thoughts: Deliberate Problem Solving with Large Language Models (Full Paper Review)
2023-05-21OpenAI suggests AI licenses (US Senate hearing on AI regulation w/ Sam Altman)
2023-05-12[ML News] Geoff Hinton leaves Google | Google has NO MOAT | OpenAI down half a billion
2023-04-27Scaling Transformer to 1M tokens and beyond with RMT (Paper Explained)
2023-04-15OpenAssistant RELEASED! The world's best open-source Chat AI!
2023-04-10AI Alignment Livestream (aka OpenAssistant "Just Chatting")
2023-04-06OpenAssistant First Models are here! (Open-Source ChatGPT)
2023-03-18The biggest week in AI (GPT-4, Office Copilot, Google PaLM, Anthropic Claude & more)



Tags:
deep learning
machine learning
arxiv
explained
neural networks
ai
artificial intelligence
paper