LLMs as General Pattern Machines: Use Arbitrary Tokens to Pattern Match?

Channel:

John Tan Chong Min

Subscribers:

6,300

Published on August 15, 2023 2:00:35 PM ● Video Link: https://www.youtube.com/watch?v=qEoiLgdQC9k

Game:

Pattern (2020)

Duration: 1:45:27

757 views

(Part 1) This is an interesting paper as it shows that LLMs can represent, manipulate, and extrapolate more abstract, nonlinguistic patterns. This is an interesting finding as all along, we have been thinking that LLMs are just great text-based completers for text with some semantic meaning.

However, I show that the methods used in this paper may not be ideal. Firstly, using random tokens is not a good strategy, as the semantic priors of these tokens still get used and may influence the results!

(Part 2) Moreover, using a reward-based approach like in Decision Transformers still takes a long time to learn. I propose a Goal-Directed Decision Transformer instead and show that it outperforms the method used in this paper!

Using an LLM as a way to associate patterns will likely be the underpinning of intelligence. However, my view is that this approach of using abstract tokens is probably not the right one.

~~~~~~~~~~~~~~~~~~~~~~~~~~~

Paper: https://arxiv.org/abs/2307.04721
Github: https://general-pattern-machines.github.io/
Slides: https://github.com/tanchongmin/TensorFlow-Implementations/blob/main/Paper_Reviews/LLMs%20as%20General%20Pattern%20Machines%20(Slides).pdf

Part 2 here containing my ideas on Goal-Directed Decision Transformers as well as a 10-year plan on intelligence: https://www.youtube.com/watch?v=rZ6hgFEe5nY

Random Tokens for Labels does not affect much? - Rethinking the Role of Demonstrations: What Makes In-Context Learning Work? https://arxiv.org/abs/2202.12837
Semantically Wrong Labels affect models - Larger Language Models do In-Context Learning Differently: https://arxiv.org/abs/2303.03846
Decision Transformer: https://arxiv.org/abs/2106.01345

My Related Videos:
LLMs to solve ARC: https://www.youtube.com/watch?v=plVRxP8hQHY
Learning, Fast and Slow: https://www.youtube.com/watch?v=Hr9zW7Usb7I

~~~~~~~~~~~~~~~~~~~~~~~~~~~

0:00 Introduction
1:09 Three approaches
8:49 Can we use random tokens?
13:43 Experiments to show mapping to random tokens may not work well!
25:38 Wrong Semantics Affect Performance
30:53 Sequence Transformation - ARC Challenge
42:08 Sequence Transformation - Grasp Detection and Forward Dynamics Prediction
45:55 Sequence Completion
49:42 Sequence Improvement - Decision Transformers
54:59 Sequence Improvement - Cart Pole
1:04:42 Markov Decision Process
1:14:45 Sequence Prediction in Cart Pole
1:18:00 Token semantic priors affect output in Cart Pole
1:22:00 How to improve Cart Pole tokenisation
1:23:55 Teaser: Is learning reward necessary?
1:32:00 Discussion

~~~~~~~~~~~~~~~~~~~~~~~~~~~

AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.

Discord: https://discord.gg/bzp87AHJy5
LinkedIn: https://www.linkedin.com/in/chong-min-tan-94652288/
Online AI blog: https://delvingintotech.wordpress.com/
Twitter: https://twitter.com/johntanchongmin
Try out my games here: https://simmer.io/@chongmin

Other Videos By John Tan Chong Min

2023-10-17	LLM Q&A #1: Prompting vs Fine-Tuning, More vs Fewer Sources for RAG, Prompting vs LLMs as a System
2023-10-10	LLMs as a System of Multiple Expert Agents to solve the ARC Challenge (Detailed Walkthrough)
2023-09-26	Everything about LLM Agents - Chain of Thought, Reflection, Tool Use, Memory, Multi-Agent Framework
2023-09-19	Moving Beyond Probabilities: Memory as World Modelling
2023-09-05	Symbolic Regression: Doing What LLMs cannot - Deriving Arbitrary Mathematical Relations!
2023-08-29	LLM Agents as a System (Prelim Findings Sharing): An Attempt to solve a 2-player 2D Escape Room!
2023-08-23	LLM as Pattern Machines(Part 2) - Goal Directed Decision Transformers, 10-Year Plan for Intelligence
2023-08-18	Tutorial #9: Evolution Game v2: ChatGPT (Text) and Dall-E (Image) API Integration!
2023-08-17	Tutorial #8: Create a Web Scraper using ChatGPT and Selenium!
2023-08-17	Tutorial #7: Create a Chatbot with Gradio and ChatGPT!
2023-08-15	LLMs as General Pattern Machines: Use Arbitrary Tokens to Pattern Match?
2023-08-08	Tutorial #6: LangChain & StrictJSON Implementation of Knowledge Graph Question Answer with LLMs
2023-08-08	Large Language Models and Knowledge Graphs: Merging Flexibility and Structure
2023-07-31	Tutorial #5: SymbolicAI - Automatic Retrieval Augmented Generation, Multimodal Inputs, User Packages
2023-07-27	How Llama 2 works: Ghost Attention, Quality Supervised Fine-tuning, RLHF for Safety and Helpfulness
2023-07-27	Llama 2 vs ChatGPT
2023-07-11	I-JEPA: Importance of Predicting in Latent Space
2023-07-09	Gen AI Study Group Introductory Tutorial - Transformers, ChatGPT, Prompt Engineering, Projects
2023-07-03	Tutorial #5: Strict JSON LLM Framework - Get LLM to output JSON exactly the way you want it!
2023-07-01	Tutorial #4: SymbolicAI ChatBot In-Depth Demonstration (Tool Use and Iterative Processing)
2023-06-29	How do we learn so fast? Towards a biologically plausible model for one-shot learning.

Other Statistics

Pattern Statistics For John Tan Chong Min

There are 757 views in 1 video for Pattern. About an hours worth of Pattern videos were uploaded to his channel, less than 0.56% of the total video content that John Tan Chong Min has uploaded to YouTube.

Channel	Latest
cottagecheez	6 hours ago
Darl Apis	6 hours ago
KuyaDudz Vlog	6 hours ago
Donkey of Astora	6 hours ago
Permata Chanel	7 hours ago
WawanDKK	7 hours ago
bthomas96	7 hours ago
NRG-FLO Gaming	7 hours ago
NBC長崎放送	7 hours ago
Locon Gamer CLIPS	7 hours ago
Fandy DS	7 hours ago
Tekken 8 Re Plays	7 hours ago
Ding Gamer	7 hours ago
Michelle eniva conde	7 hours ago
OPEN TV	7 hours ago
이카리 iKARi	7 hours ago
VGAMA02	7 hours ago
ZebazPvD	7 hours ago
Winkazi	8 hours ago
NixUniverse	8 hours ago
Akraeth \| All JRPG	8 hours ago
PlayStation Australia	8 hours ago
Greg Renko	8 hours ago
YuWenMay佑文美	8 hours ago
CLINT COMPOSE	8 hours ago