Everything about LLM Agents - Chain of Thought, Reflection, Tool Use, Memory, Multi-Agent Framework

Channel:

John Tan Chong Min

Subscribers:

6,300

Published on September 26, 2023 10:10:47 AM ● Video Link: https://www.youtube.com/watch?v=Ll7lRBaP378

Duration: 1:55:31

1,115 views

How do LLM Agents work?

How does a language model understand the world, and know how to use tools/plugins/APIs?

How can we use LLMs as a System for more complicated tasks?

If you seek to find out the answers to these, this session is for you!

https://www.youtube.com/watch?v=Ll7lRBaP378

~~~~~~~~~~~~~~~~

Slides: https://github.com/tanchongmin/TensorFlow-Implementations/blob/main/Paper_Reviews/Introduction%20to%20Agents.pdf

My own referenced research:
Learning, Fast and Slow: https://www.youtube.com/watch?v=Hr9zW7Usb7I
LLMs as a System for the ARC Challenge: https://www.youtube.com/watch?v=plVRxP8hQHY

My own referenced framework:
StrictJSON: https://www.youtube.com/watch?v=A6sIh-lmApk

Reference Papers:
Planning:
ReAct: https://arxiv.org/abs/2210.03629
Reflexion: https://arxiv.org/abs/2303.11366
SayCan: https://say-can.github.io/

Tool Usage:
Visual ChatGPT: https://arxiv.org/abs/2303.04671
HuggingGPT: https://arxiv.org/abs/2303.17580
Voyager: https://arxiv.org/abs/2305.16291
Ghost in the MineCraft: https://arxiv.org/abs/2305.17144

Memory:
Retrieval Augmented Generation: https://proceedings.neurips.cc/paper/2020/file/6b493230205f780e1bc26945df7481e5-Paper.pdf
Recitation Augmented Generation (change the retrieved memory according to hints): https://arxiv.org/abs/2210.01296
Knowledge Graph as JSON - Generative Agents: Interactive Simulacra: https://arxiv.org/abs/2304.03442
Pyschology - Eyewitness Testimony (Loftus et al, 1975) - How memory retrieval is influenced by wording: https://link.springer.com/content/pdf/10.3758/BF03336715.pdf

Multi-agent:
AutoGPT: https://github.com/Significant-Gravitas/AutoGPT
BabyAGI: https://github.com/yoheinakajima/babyagi
Camel - Society of Minds: https://arxiv.org/abs/2303.17760
ChatDev - Sequential Product Development using Camel: https://arxiv.org/abs/2307.07924

My relevant videos on LLMs:
How ChatGPT works: https://www.youtube.com/watch?v=wA8rjKueB3Q
SayCan: https://www.youtube.com/watch?v=iS3ikfSsp6Y
OpenAI Vector Embeddings: https://www.youtube.com/watch?v=lIoLCip0HwM
Generative Agents: Interactive Simulacra: https://www.youtube.com/watch?v=_pkktFIcZRo
Voyager: https://www.youtube.com/watch?v=Y-pgbjTlYgk
Ghost in the MineCraft: https://www.youtube.com/watch?v=_VXOczXIkks
LLMs and Knowledge Graphs: https://www.youtube.com/watch?v=1RZ5yIyz31c
LLM Agents as a System to solve a 2D Escape Room: https://www.youtube.com/watch?v=OQeBg7-pME4

~~~~~~~~~~~~~~~~

0:00 Introduction
0:38 Story of an Agent
30:40 What are agents?
33:52 Chain of Thought to various levels of Abstractions
39:36 Incorporating World Feedback - ReAct and Reflexion
46:36 Voyager - Iterative Prompting with World Feedback
50:36 Tool Usage
1:03:30 Tool Learning and Composing
1:07:52 Memory
1:26:11 Multi-agent systems
1:38:04 Challenges of Implementing Agents
1:48:30 Discussion

~~~~~~~~~~~~~~~~

AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.

Discord: https://discord.gg/bzp87AHJy5
LinkedIn: https://www.linkedin.com/in/chong-min-tan-94652288/
Online AI blog: https://delvingintotech.wordpress.com/
Twitter: https://twitter.com/johntanchongmin
Try out my games here: https://simmer.io/@chongmin

Other Videos By John Tan Chong Min

2023-11-07	Learning, Fast and Slow: My Landmark Idea for fast, adaptable agents (ICDL 2023 Best Paper Finalist)
2023-11-06	A roadmap for AI: Past, Present and Future (Part 2): Fixed vs Flexible, Memory Soup vs Hierarchy
2023-11-03	AI & Education: Education when AI tools are smarter than us - Discussion with Kuang Wen (Part 2)
2023-11-03	AI & Education: RAG Question-Answer, Test Question Generator, Autograder by Kuang Wen! (Part 1)
2023-10-31	A Roadmap for AI: Past, Present and Future (Part 1)
2023-10-28	Tutorial #10: StrictJSON v2 (StrictText): Handle any output - quotation marks or backslash!
2023-10-24	ChatDev: Can LLM Agents really replace a software company?
2023-10-17	LLMs and Robotics: An Overview by Daniel Tan!
2023-10-17	LLM Q&A #1: Prompting vs Fine-Tuning, More vs Fewer Sources for RAG, Prompting vs LLMs as a System
2023-10-10	LLMs as a System of Multiple Expert Agents to solve the ARC Challenge (Detailed Walkthrough)
2023-09-26	Everything about LLM Agents - Chain of Thought, Reflection, Tool Use, Memory, Multi-Agent Framework
2023-09-19	Moving Beyond Probabilities: Memory as World Modelling
2023-09-05	Symbolic Regression: Doing What LLMs cannot - Deriving Arbitrary Mathematical Relations!
2023-08-29	LLM Agents as a System (Prelim Findings Sharing): An Attempt to solve a 2-player 2D Escape Room!
2023-08-23	LLM as Pattern Machines(Part 2) - Goal Directed Decision Transformers, 10-Year Plan for Intelligence
2023-08-18	Tutorial #9: Evolution Game v2: ChatGPT (Text) and Dall-E (Image) API Integration!
2023-08-17	Tutorial #8: Create a Web Scraper using ChatGPT and Selenium!
2023-08-17	Tutorial #7: Create a Chatbot with Gradio and ChatGPT!
2023-08-15	LLMs as General Pattern Machines: Use Arbitrary Tokens to Pattern Match?
2023-08-08	Tutorial #6: LangChain & StrictJSON Implementation of Knowledge Graph Question Answer with LLMs
2023-08-08	Large Language Models and Knowledge Graphs: Merging Flexibility and Structure

Channel	Latest
CohhCarnage	10 hours ago
Farod Live [REDIFF - VOD]	10 hours ago
raocow	11 hours ago
CHAQN2	11 hours ago
cottagecheez	12 hours ago
Darl Apis	12 hours ago
KuyaDudz Vlog	12 hours ago
lugeyps3	13 hours ago
Donkey of Astora	13 hours ago
Permata Chanel	13 hours ago
WawanDKK	13 hours ago
bthomas96	13 hours ago
NRG-FLO Gaming	13 hours ago
NBC長崎放送	13 hours ago
Locon Gamer CLIPS	13 hours ago
ZackScottGames	13 hours ago
Fandy DS	13 hours ago
Tekken 8 Re Plays	13 hours ago
Ding Gamer	13 hours ago
Michelle eniva conde	13 hours ago
OPEN TV	13 hours ago
IGN	13 hours ago
이카리 iKARi	14 hours ago
VGAMA02	14 hours ago
ZebazPvD	14 hours ago