LLM as Pattern Machines(Part 2) - Goal Directed Decision Transformers, 10-Year Plan for Intelligence

Channel:

John Tan Chong Min

Subscribers:

5,450

Published on August 24, 2023 1:24:38 AM ● Video Link: https://www.youtube.com/watch?v=rZ6hgFEe5nY

Duration: 1:42:34

472 views

(Part 1) This is an interesting paper as it shows that LLMs can represent, manipulate, and extrapolate more abstract, nonlinguistic patterns. This is an interesting finding as all along, we have been thinking that LLMs are just great text-based completers for text with some semantic meaning.

However, I show that the methods used in this paper may not be ideal. Firstly, using random tokens is not a good strategy, as the semantic priors of these tokens still get used and may influence the results!

(Part 2) Moreover, using a reward-based approach like in Decision Transformers still takes a long time to learn. I propose a Goal-Directed Decision Transformer instead and show that it outperforms the method used in this paper!

I also provide a 10-year vision of how I think intelligent machines can be created, using concepts that I have learned over various papers and my own experiments.

Using an LLM as a way to associate patterns will likely be the underpinning of intelligence. However, my view is that this approach of using abstract tokens is probably not the right one. We will still need to imbue semantic meaning of some sort for better performance.

~~~~~~~~~~~~~~~~~~~~~~

Paper: https://arxiv.org/abs/2307.04721
Slides: https://github.com/tanchongmin/TensorFlow-Implementations/blob/main/Paper_Reviews/LLMs%20as%20General%20Pattern%20Machines%20(Slides).pdf

My related videos:
LLMs as General Pattern Machines (Part 1): https://www.youtube.com/watch?v=qEoiLgdQC9k
Learning Fast and Slow: https://www.youtube.com/watch?v=Hr9zW7Usb7I
LLMs as a System to solve ARC: https://www.youtube.com/watch?v=plVRxP8hQHY
Decision Transformers: https://www.youtube.com/watch?v=AW7vHggnAps
Voyager (High level LLM-based planning in MineCraft to functional Code): https://www.youtube.com/watch?v=Y-pgbjTlYgk
Ghost in the MineCraft (High level LLM-based planning in MineCraft to list of functions to call): https://www.youtube.com/watch?v=_VXOczXIkks
HyperTree Proof Search: https://www.youtube.com/watch?v=CIGl2NboS9s
GATO (Generalist Agent by DeepMind): https://www.youtube.com/watch?v=ENspggRUs4U
Knowledge Graphs and LLMs: https://www.youtube.com/watch?v=1RZ5yIyz31c
Joint-Embedding Predictive Architecture (Yann LeCun): https://www.youtube.com/watch?v=M98OLk30dBk
Hierarchy and its use in AI: https://www.youtube.com/watch?v=1x049Dmxes0

~~~~~~~~~~~~~~~~~~~~~~~

0:00 Introduction and Recap
3:40 Decision Transformers
7:15 Is Learning Reward Necessary?
14:40 Can LLMs do logic?
29:45 Alternate Rewards for Learning (Goals)
32:10 Experiments of Reward vs Goal Conditioning (Goal Conditioning is Better)
43:08 Semantic-description based prompting improves outcomes!
47:20 How to incorporate goals into Cart Pole
53:43 My 10-Year Vision for Intelligence
1:13:38 Goal-Directed Decision Transformers (looking for collaborators)
1:15:11 Conclusion and Discussion

~~~~~~~~~~~~~~~~~~~~~~~

AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.

Discord: https://discord.gg/bzp87AHJy5
LinkedIn: https://www.linkedin.com/in/chong-min-tan-94652288/
Online AI blog: https://delvingintotech.wordpress.com/
Twitter: https://twitter.com/johntanchongmin
Try out my games here: https://simmer.io/@chongmin

Other Videos By John Tan Chong Min

2023-10-31	A Roadmap for AI: Past, Present and Future (Part 1)
2023-10-28	Tutorial #10: StrictJSON v2 (StrictText): Handle any output - quotation marks or backslash!
2023-10-24	ChatDev: Can LLM Agents really replace a software company?
2023-10-17	LLMs and Robotics: An Overview by Daniel Tan!
2023-10-17	LLM Q&A #1: Prompting vs Fine-Tuning, More vs Fewer Sources for RAG, Prompting vs LLMs as a System
2023-10-10	LLMs as a System of Multiple Expert Agents to solve the ARC Challenge (Detailed Walkthrough)
2023-09-26	Everything about LLM Agents - Chain of Thought, Reflection, Tool Use, Memory, Multi-Agent Framework
2023-09-19	Moving Beyond Probabilities: Memory as World Modelling
2023-09-05	Symbolic Regression: Doing What LLMs cannot - Deriving Arbitrary Mathematical Relations!
2023-08-29	LLM Agents as a System (Prelim Findings Sharing): An Attempt to solve a 2-player 2D Escape Room!
2023-08-23	LLM as Pattern Machines(Part 2) - Goal Directed Decision Transformers, 10-Year Plan for Intelligence
2023-08-18	Tutorial #9: Evolution Game v2: ChatGPT (Text) and Dall-E (Image) API Integration!
2023-08-17	Tutorial #8: Create a Web Scraper using ChatGPT and Selenium!
2023-08-17	Tutorial #7: Create a Chatbot with Gradio and ChatGPT!
2023-08-15	LLMs as General Pattern Machines: Use Arbitrary Tokens to Pattern Match?
2023-08-08	Tutorial #6: LangChain & StrictJSON Implementation of Knowledge Graph Question Answer with LLMs
2023-08-08	Large Language Models and Knowledge Graphs: Merging Flexibility and Structure
2023-07-31	Tutorial #5: SymbolicAI - Automatic Retrieval Augmented Generation, Multimodal Inputs, User Packages
2023-07-27	How Llama 2 works: Ghost Attention, Quality Supervised Fine-tuning, RLHF for Safety and Helpfulness
2023-07-27	Llama 2 vs ChatGPT
2023-07-11	I-JEPA: Importance of Predicting in Latent Space

Channel	Latest
Scott Jund	6 hours ago
Smutsen	6 hours ago
BeastyqtSC2	6 hours ago
Exalted	6 hours ago
Bonkol Live	6 hours ago
Teh Spearhead	6 hours ago
Ashe Challenger	6 hours ago
Austinmp88	6 hours ago
Ask About Parenting & Care	6 hours ago
GranaDy	7 hours ago
Catninja909	7 hours ago
Sion VOD Gaming	7 hours ago
Outplanet Studios	7 hours ago
RakuInariLP	7 hours ago
Xmilek62	7 hours ago
BranOnline	7 hours ago
ketsueki_randi	7 hours ago
beavsbaut	7 hours ago
PIMPNITE	7 hours ago
JugZone	7 hours ago
ItzMiketheman	7 hours ago
Secretnc	7 hours ago
Jeisonlk	7 hours ago
Kaghoegaming	7 hours ago
The Missing Level	7 hours ago