First Author Interview: AI & formal math (Formal Mathematics Statement Curriculum Learning)

Channel:

Yannic Kilcher

Subscribers:

291,000

Published on March 6, 2022 4:27:24 PM ● Video Link: https://www.youtube.com/watch?v=kl3aBni87jg

Duration: 58:08

7,534 views

227

#openai #math #imo

This is an interview with Stanislas Polu, research engineer at OpenAI and first author of the paper "Formal Mathematics Statement Curriculum Learning".
Watch the paper review here: https://youtu.be/lvYVuOmUVs8

OUTLINE:
0:00 - Intro
2:00 - How do you explain the big public reaction?
4:00 - What's the history behind the paper?
6:15 - How does algorithmic formal math work?
13:10 - How does expert iteration replace self-play?
22:30 - How is the language model trained and used?
30:50 - Why is every model fine-tuned on the initial state?
33:05 - What if we want to prove something we don't know already?
40:35 - How can machines and humans work together?
43:40 - Aren't most produced statements useless?
46:20 - A deeper look at the experimental results
50:10 - What were the high and low points during the research?
54:25 - Where do we go from here?

Paper: https://arxiv.org/abs/2202.01344
miniF2F benchmark: https://github.com/openai/miniF2F
Follow Stan here: https://twitter.com/spolu

Abstract:
We explore the use of expert iteration in the context of language modeling applied to formal mathematics. We show that at same compute budget, expert iteration, by which we mean proof search interleaved with learning, dramatically outperforms proof search only. We also observe that when applied to a collection of formal statements of sufficiently varied difficulty, expert iteration is capable of finding and solving a curriculum of increasingly difficult problems, without the need for associated ground-truth proofs. Finally, by applying this expert iteration to a manually curated set of problem statements, we achieve state-of-the-art on the miniF2F benchmark, automatically solving multiple challenging problems drawn from high school olympiads.

Authors: Stanislas Polu, Jesse Michael Han, Kunhao Zheng, Mantas Baksys, Igor Babuschkin, Ilya Sutskever

Links:
TabNine Code Completion (Referral): http://bit.ly/tabnine-yannick
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
Discord: https://discord.gg/4H8xxDF
BitChute: https://www.bitchute.com/channel/yannic-kilcher
LinkedIn: https://www.linkedin.com/in/ykilcher
BiliBili: https://space.bilibili.com/2017636191

If you want to support me, the best thing to do is to share out the content :)

If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar: https://www.subscribestar.com/yannickilcher
Patreon: https://www.patreon.com/yannickilcher
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n

Other Videos By Yannic Kilcher

2022-03-26	Author Interview - Typical Decoding for Natural Language Generation
2022-03-25	Typical Decoding for Natural Language Generation (Get more human-like outputs from language models!)
2022-03-24	One Model For All The Tasks - BLIP (Author Interview)
2022-03-23	BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding&Generation
2022-03-21	[ML News] AI Threatens Biological Arms Race
2022-03-20	Active Dendrites avoid catastrophic forgetting - Interview with the Authors
2022-03-18	Avoiding Catastrophe: Active Dendrites Enable Multi-Task Learning in Dynamic Environments (Review)
2022-03-14	Author Interview - VOS: Learning What You Don't Know by Virtual Outlier Synthesis
2022-03-13	VOS: Learning What You Don't Know by Virtual Outlier Synthesis (Paper Explained)
2022-03-08	Spurious normativity enhances learning of compliance and enforcement behavior in artificial agents
2022-03-06	First Author Interview: AI & formal math (Formal Mathematics Statement Curriculum Learning)
2022-03-05	OpenAI tackles Math - Formal Mathematics Statement Curriculum Learning (Paper Explained)
2022-03-04	[ML News] DeepMind controls fusion \| Yann LeCun's JEPA architecture \| US: AI can't copyright its art
2022-03-02	AlphaCode - with the authors!
2022-03-01	Competition-Level Code Generation with AlphaCode (Paper Review)
2022-02-28	Can Wikipedia Help Offline Reinforcement Learning? (Author Interview)
2022-02-26	Can Wikipedia Help Offline Reinforcement Learning? (Paper Explained)
2022-02-23	[ML Olds] Meta Research Supercluster \| OpenAI GPT-Instruct \| Google LaMDA \| Drones fight Pigeons
2022-02-21	Listening to You! - Channel Update (Author Interviews)
2022-02-20	All about AI Accelerators: GPU, TPU, Dataflow, Near-Memory, Optical, Neuromorphic & more (w/ Author)
2022-02-18	[ML News] Uber: Deep Learning for ETA \| MuZero Video Compression \| Block-NeRF \| EfficientNet-X

Tags:

deep learning

machine learning

arxiv

explained

neural networks

artificial intelligence

paper

openai

formal math

ai math

ai math prover

machine learning for math

ml math

artificial intelligence math

ai mathematics

automated proof search

mini f2f

ai imo

ai math olympiad

openai mathematics

openai formal math

language models formal math

lean

lean prover

lean proof

lean math

ai lean environment

ai proves theorems

ai theorem prover

Channel	Latest
Roy The Gamer.	6 hours ago
HellfireComms	11 hours ago
penguinz0	12 hours ago
Zanar Aesthetics	13 hours ago
Svarush	14 hours ago
LongplayArchive	14 hours ago
Õhtuleht	14 hours ago
Pico Shogun	15 hours ago
Momoterasu	15 hours ago
Bass City	15 hours ago
ETwo4Three	15 hours ago
Henry Chhouk	15 hours ago
TueurDeBikette	15 hours ago
Suns	15 hours ago
Mati Clips	15 hours ago
Carlotta ASMR	16 hours ago
Shazam Sakazaki	16 hours ago
Cardboard Tube Knight	16 hours ago
ÉducaTube	16 hours ago
Jaegerchere	16 hours ago
lucas gameplays	16 hours ago
Darth Luke	16 hours ago
RobertIDK	16 hours ago
Ajarn Spencer	16 hours ago
Lazycorner07	16 hours ago