HyperTree Proof Search - Automated Theorem Proving with AlphaZero and Transformers!
Ever wondered what it is like to use AlphaZero style techniques and Transformers to solve mathematical theorems? HyperTree Proof Search by Meta is here. This employs the Transformer architecture as the Policy and Value networks for AlphaZero, with the goal to be solved as the prompt. The policy network predicts the next action (tactic), and is able to output tactics of variable lengths. The value network predicts the probability of a given goal being solved, and is restricted to outputting "Provable" and "Unprovable" as a next token, akin to a classification problem, and the value is given by the probability it will output the "Provable" token!! These are all pretty interesting ideas to extend AlphaZero to a search problem with infinite number of actions.
Solving automated theorem proving might be a first step to helping to build architectures capable of reasoning and hence explainable neural networks. There is a use for such architectures, but the caveat is that it can be very compute intensive. No wonder evolution does not automatically endow us with a purely logical brain - it will be too energy intensive and time intensive to derive decisions for our survival.
Paper can be found at: https://arxiv.org/abs/2205.11491
Slides can be found at: https://github.com/tanchongmin/TensorFlow-Implementations/blob/main/Paper_Reviews/Hypertree%20Proof%20Search%20Slides.pdf
Video on AlphaGo / AlphaZero: https://www.youtube.com/watch?v=a4lrFMSwGWQ
Video on AlphaTensor: https://www.youtube.com/watch?v=6nYqyTzOQjw
0:00 Introduction and Motivation
4:53 Forwards and Backwards Proving Process
10:21 Key Insight of Theorem Proving
11:06 Sample Theorem in Lean
14:13 Lean Proof-tree
19:33 Lean Demo
26:55 Godel Incompleteness Theorem
28:30 Do humans think in logical statements?
33:05 AlphaZero Recap
43:44 HyperTree Proof Search Overall
58:23 HyperTree Proof Search Specifics
1:03:25 Policy and Value Networks (Transformers) Training
1:10:00 Difference from AlphaZero
1:12:28 Results on Lean
1:13:29 Discussion on HyperTree Proof Search Architecture
1:24:12 Discussion on logical thinking
1:42:48 Final words and how to train complex models efficiently
~~~~~~~~~~~~~~~~~~~~~~~~~~
AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.
Discord: https://discord.gg/fXCZCPYs
LinkedIn: https://www.linkedin.com/in/chong-min-tan-94652288/
Online AI blog: https://delvingintotech.wordpress.com/.
Twitter: https://twitter.com/johntanchongmin
Try out my games here: https://simmer.io/@chongmin