High-level planning with large language models - SayCan

Channel:

John Tan Chong Min

Subscribers:

6,300

Published on February 21, 2023 11:44:43 AM ● Video Link: https://www.youtube.com/watch?v=iS3ikfSsp6Y

Duration: 2:25:22

1,186 views

How can we plan using a language model? SayCan offers us a way to do long-term planning using a language model grounded with affordances to determine feasible actions. The language model provides the flexibility of interpreting user input by natural language, while the affordance function helps to ensure that only feasible actions are chosen.

This is a form of hierarchical planning, whereby the actions are tasks to fulfil for the lower-level systems. Here, the lower-level systems are modular skills and are trained with reinforcement learning/pre-defined. In order to map these lower-level systems to higher level planning, we utilize SayCan to select the right actions to fulfil the overall goal.

On a related note, the idea of grounding language models in affordances might also help constrain the outputs of language models such as ChatGPT to make it more reliable.

~~~~

Relevant Reading:
SayCan: https://say-can.github.io/
SayCan Presentation Slides: https://github.com/tanchongmin/TensorFlow-Implementations/blob/main/Paper_Reviews/SayCan%20Slides.pdf
Learning, Fast and Slow (my idea of using two systems for learning, of which SayCan can help with making arbitrary goals quantifiable): https://www.youtube.com/watch?v=Hr9zW7Usb7I
Inner Monologue (using self-thought to improve performance): https://arxiv.org/abs/2207.05608
Socratic Model (multiple domain experts coming to a decision): https://arxiv.org/abs/2204.00598
Chain of Thought Reasoning: https://arxiv.org/abs/2201.11903
PaLM language model: https://arxiv.org/abs/2204.02311

Pick and Place architecture:
ViLD (to detect objects): https://arxiv.org/abs/2104.13921
CLIPort (action from text): https://cliport.github.io/
Transporter Networks (spatial symmetries for policies): https://transporternets.github.io/
~~~~

0:00 Introduction
2:19 Main Question
7:55 Preliminary Video
9:17 LLMs may not output valid actions
16:02 Language and Affordance
26:14 Training Value Function with Reinforcement Learning
32:50 SayCan Overview
35:48 Evaluating SayCan on Example Tasks
50:38 SayCan Overall Algorithm
1:02:43 Code Walkthrough
1:18:50 Kitchen Environment
1:22:53 Results
1:33:04 Chain-of-thought reasoning
1:49:36 Connection to Fast & Slow
1:55:45 Connection to Reliable GPT
1:58:08 Discussion
2:22:50 Conclusion

~~~~
AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.

Discord: https://discord.gg/fXCZCPYs
LinkedIn: https://www.linkedin.com/in/chong-min-tan-94652288/
Online AI blog: https://delvingintotech.wordpress.com/.
Twitter: https://twitter.com/johntanchongmin
Try out my games here: https://simmer.io/@chongmin

Other Videos By John Tan Chong Min

2023-04-17	Can GPT4 solve the Abstraction and Reasoning Corpus (ARC) Challenge Zero-Shot?
2023-04-12	GPT4: Zero-shot Classification without any examples + Fine-tune with reflection
2023-04-11	OpenAI Vector Embeddings - Talk to any book or document; Retrieval-Augmented Generation!
2023-04-11	Tutorial #2: OpenAI Vector Embeddings and Pinecone for Retrieval-Augmented Generation
2023-04-04	Creating JARVIS: ChatGPT + APIs - HuggingGPT, Memory-Augmented Context, Meta GPT structures
2023-04-02	Is GPT4 capable of self-improving? Are we heading for AGI or AI doom?
2023-03-28	How Visual ChatGPT works + Toolformer/Wolfram Alpha. LLMs with Tools/APIs/Plugins is the way ahead!
2023-03-21	Tokenize any input, even continuous vectors! - Residual Vector Quantization - VALL-E (Part 2)
2023-03-07	Using Transformers to mimic anyone's voice! - VALL-E (Part 1)
2023-02-28	Learning Part-Whole Structure by Chunking - More Efficient than Deep Learning!!!
2023-02-21	High-level planning with large language models - SayCan
2023-02-13	Learning, Fast and Slow: Towards Fast and Adaptable Agents in Changing Environments
2023-02-07	Using Logic Gates as Neurons - Deep Differentiable Logic Gate Networks!
2023-01-31	Learn from External Memory, not just Weights: Large-Scale Retrieval for Reinforcement Learning
2023-01-17	How ChatGPT works - From Transformers to Reinforcement Learning with Human Feedback (RLHF)
2023-01-09	HyperTree Proof Search - Automated Theorem Proving with AlphaZero and Transformers!
2022-12-23	CodinGame Fall Challenge 2022: A First Look (managed to get to Silver!)
2022-12-21	Can ChatGPT solve CodinGame/Google Kickstart problems?
2022-12-19	Reinforcement Learning Fast and Slow: Goal-Directed and Memory Retrieval Mechanism!
2022-12-12	A New Framework of Memory for Learning (Part 1)
2022-11-14	Hippocampal Replay for Learning (Full Length with Questions)

Channel	Latest
IOSTouchplayHD	6 hours ago
Northlight TV	6 hours ago
The Propagandacast	6 hours ago
SaGooDUp	6 hours ago
Mukimuk	6 hours ago
ZockArena	6 hours ago
Geek Out Fantasy	6 hours ago
SAT.1 REGIONAL	6 hours ago
Evsyukov Play	6 hours ago
99 GG	6 hours ago
Tvoy Igrovoy — channel about board games	6 hours ago
囲炉裏のまったりゲームCH	6 hours ago
Ghelloz	6 hours ago
SmashTom	6 hours ago
Nashara	6 hours ago
오늘의 코인뉴스	6 hours ago
Bladii	7 hours ago
よしなま	7 hours ago
悟ったハーランド【サッカーみんなの反応】	7 hours ago
HDblog	7 hours ago
Gotagx	7 hours ago
遊戲狂人	7 hours ago
哈奇Hachi	7 hours ago
Koga Kocheng Gamer	7 hours ago
Martini Pictures	7 hours ago