LLMs as a System of Multiple Expert Agents to solve the ARC Challenge (Detailed Walkthrough)

Channel:

John Tan Chong Min

Subscribers:

6,300

Published on October 10, 2023 10:16:16 AM ● Video Link: https://www.youtube.com/watch?v=sTvonsD5His

Category:

Walkthrough

Duration: 1:27:55

884 views

Can GPT4 solve the ARC Challenge?

I have been working on this problem for half a year, and I present the LLM Agents as a System approach to solve it!

We utilise various different views to represent the problem - pixel, object, grid, use common primitive functions and conditional functions to ground the action space, incorporate environment feedback like in Voyager.

The result - 50 solves out of 111 training set problems (which can fit in 3000 tokens context length, catering for environment feedback)!

~~~~~~~~~~~~~~~~
Paper: https://arxiv.org/abs/2310.05146
Slides: https://github.com/tanchongmin/TensorFlow-Implementations/blob/main/Paper_Reviews/LLMs%20as%20a%20System%20for%20the%20ARC%20Challenge.pdf
Github: https://github.com/tanchongmin/ARC-Challenge

Earlier ARC Videos:
LLMs as a System to Solve ARC (Part 1): https://www.youtube.com/watch?v=plVRxP8hQHY
Initial Experimentation using GPT4: https://www.youtube.com/watch?v=vt2yG1da8Fg

Language-complete ARC (LARC): https://arxiv.org/pdf/2106.07824.pdf
Michael Hodel's Domain-Specific Language: https://github.com/michaelhodel/arc-dsl
Voyager: https://arxiv.org/abs/2305.16291
LLMs and Object View: https://arxiv.org/abs/2305.18354

ARCathon: https://lab42.global/arcathon/
On The Measure of Intelligence: https://arxiv.org/abs/1911.01547

~~~~~~~~~~~~~~~~

0:00 Introduction
1:50 Methods to solve ARC
4:32 Transformers
19:28 Language ARC
20:40 Seven Steps to Greatness (Solving ARC)
21:10 Step 1: LARC
22:28 Step 2: Chain of Thought
26:53 Step 3: Code Generation
28:49 Step 4: Multiple Abstraction Spaces
40:55 Step 5: Grounding in Action Space
46:41 Step 6: Using Conditionals
50:26 Step 7: Environment Feedback
51:48 Using Memory for Learning
1:03:00 Results
1:05:25 Example Flow using problem d0f5fe59
1:09:21 Visual Results of Solves and Non-Solves
1:17:30 Potential Improvements
1:18:15 Discussion

~~~~~~~~~~~~~~~~

AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.

Discord: https://discord.gg/bzp87AHJy5
LinkedIn: https://www.linkedin.com/in/chong-min-tan-94652288/
Online AI blog: https://delvingintotech.wordpress.com/
Twitter: https://twitter.com/johntanchongmin
Try out my games here: https://simmer.io/@chongmin

Other Videos By John Tan Chong Min

2023-11-14	A Roadmap for AI: Past, Present and Future (Part 3) - Multi-Agent, Multiple Sampling and Filtering
2023-11-07	Learning, Fast and Slow: My Landmark Idea for fast, adaptable agents (ICDL 2023 Best Paper Finalist)
2023-11-06	A roadmap for AI: Past, Present and Future (Part 2): Fixed vs Flexible, Memory Soup vs Hierarchy
2023-11-03	AI & Education: Education when AI tools are smarter than us - Discussion with Kuang Wen (Part 2)
2023-11-03	AI & Education: RAG Question-Answer, Test Question Generator, Autograder by Kuang Wen! (Part 1)
2023-10-31	A Roadmap for AI: Past, Present and Future (Part 1)
2023-10-28	Tutorial #10: StrictJSON v2 (StrictText): Handle any output - quotation marks or backslash!
2023-10-24	ChatDev: Can LLM Agents really replace a software company?
2023-10-17	LLMs and Robotics: An Overview by Daniel Tan!
2023-10-17	LLM Q&A #1: Prompting vs Fine-Tuning, More vs Fewer Sources for RAG, Prompting vs LLMs as a System
2023-10-10	LLMs as a System of Multiple Expert Agents to solve the ARC Challenge (Detailed Walkthrough)
2023-09-26	Everything about LLM Agents - Chain of Thought, Reflection, Tool Use, Memory, Multi-Agent Framework
2023-09-19	Moving Beyond Probabilities: Memory as World Modelling
2023-09-05	Symbolic Regression: Doing What LLMs cannot - Deriving Arbitrary Mathematical Relations!
2023-08-29	LLM Agents as a System (Prelim Findings Sharing): An Attempt to solve a 2-player 2D Escape Room!
2023-08-23	LLM as Pattern Machines(Part 2) - Goal Directed Decision Transformers, 10-Year Plan for Intelligence
2023-08-18	Tutorial #9: Evolution Game v2: ChatGPT (Text) and Dall-E (Image) API Integration!
2023-08-17	Tutorial #8: Create a Web Scraper using ChatGPT and Selenium!
2023-08-17	Tutorial #7: Create a Chatbot with Gradio and ChatGPT!
2023-08-15	LLMs as General Pattern Machines: Use Arbitrary Tokens to Pattern Match?
2023-08-08	Tutorial #6: LangChain & StrictJSON Implementation of Knowledge Graph Question Answer with LLMs

Channel	Latest
CohhCarnage	10 hours ago
Farod Live [REDIFF - VOD]	10 hours ago
raocow	11 hours ago
CHAQN2	11 hours ago
cottagecheez	12 hours ago
Darl Apis	12 hours ago
KuyaDudz Vlog	12 hours ago
lugeyps3	13 hours ago
Donkey of Astora	13 hours ago
Permata Chanel	13 hours ago
WawanDKK	13 hours ago
bthomas96	13 hours ago
NRG-FLO Gaming	13 hours ago
NBC長崎放送	13 hours ago
Locon Gamer CLIPS	13 hours ago
ZackScottGames	13 hours ago
Fandy DS	13 hours ago
Tekken 8 Re Plays	13 hours ago
Ding Gamer	13 hours ago
Michelle eniva conde	13 hours ago
OPEN TV	13 hours ago
IGN	13 hours ago
이카리 iKARi	14 hours ago
VGAMA02	14 hours ago
ZebazPvD	14 hours ago