LLM Agents as a System (Prelim Findings Sharing): An Attempt to solve a 2-player 2D Escape Room!

Channel:

John Tan Chong Min

Subscribers:

5,450

Published on August 29, 2023 10:23:45 AM ● Video Link: https://www.youtube.com/watch?v=OQeBg7-pME4

Duration: 1:37:14

243 views

We started building this system as part of an LLM Agent Hackathon, and the idea is to have collaborative actions taken by LLM Agents to solve an overall task!

Richard shares his conversational framework where the agents converse with one another and with a coach to give feedback to the agents.

I share about my framework which uses a planner to plan macro-level actions for the two agents, and the two agents will then use these macro-level actions as their goal and plan micro-level actions. In order to save inference costs, the micro-level plan is in the form of a list of actions, which will be taken one at a time until completion before querying the planner again for a new goal.

We share tips about what works and what does not, as well as some suggested reason behind them. In short, navigation is not native to LLMs and we need to ground it in the environment space like move (1, 0) rather than just text - the main thing for LLM Agents is that there must be semantic grounding in the action space. The order of prompting also matters and the LLM Agent can be prompted to give more specific outputs if we prompt it in the right sequence for it to make decisions in a step by step (broad to specific) manner.

As improvement, we can consider giving the environment feedback to the agents for planning (like Voyager for Minecraft), and we can also have a better way of feeding context into the LLM, perhaps using context-dependent knowledge graphs.

~~~~~~~~~~~~~~~~~~~~~

Richard Github Repo: https://github.com/richard087/chatterbug
John Github Repo: https://github.com/tanchongmin/strictjson/tree/main/EscapeRoom

Voyager: https://www.youtube.com/watch?v=Y-pgbjTlYgk
Ghost in the MineCraft: https://www.youtube.com/watch?v=_VXOczXIkks

~~~~~~~~~~~~~~~~~~~~~

0:00 Introduction
1:06 Richard’s Work on Conversational Agents with a Coach
45:19 Step on two squares 2D Escape Room
1:13:36 Pick-Place 2D Escape Room

~~~~~~~~~~~~~~~~~~~~~

AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.

Discord: https://discord.gg/bzp87AHJy5
LinkedIn: https://www.linkedin.com/in/chong-min-tan-94652288/
Online AI blog: https://delvingintotech.wordpress.com/
Twitter: https://twitter.com/johntanchongmin
Try out my games here: https://simmer.io/@chongmin

Other Videos By John Tan Chong Min

2023-11-03	AI & Education: RAG Question-Answer, Test Question Generator, Autograder by Kuang Wen! (Part 1)
2023-10-31	A Roadmap for AI: Past, Present and Future (Part 1)
2023-10-28	Tutorial #10: StrictJSON v2 (StrictText): Handle any output - quotation marks or backslash!
2023-10-24	ChatDev: Can LLM Agents really replace a software company?
2023-10-17	LLMs and Robotics: An Overview by Daniel Tan!
2023-10-17	LLM Q&A #1: Prompting vs Fine-Tuning, More vs Fewer Sources for RAG, Prompting vs LLMs as a System
2023-10-10	LLMs as a System of Multiple Expert Agents to solve the ARC Challenge (Detailed Walkthrough)
2023-09-26	Everything about LLM Agents - Chain of Thought, Reflection, Tool Use, Memory, Multi-Agent Framework
2023-09-19	Moving Beyond Probabilities: Memory as World Modelling
2023-09-05	Symbolic Regression: Doing What LLMs cannot - Deriving Arbitrary Mathematical Relations!
2023-08-29	LLM Agents as a System (Prelim Findings Sharing): An Attempt to solve a 2-player 2D Escape Room!
2023-08-23	LLM as Pattern Machines(Part 2) - Goal Directed Decision Transformers, 10-Year Plan for Intelligence
2023-08-18	Tutorial #9: Evolution Game v2: ChatGPT (Text) and Dall-E (Image) API Integration!
2023-08-17	Tutorial #8: Create a Web Scraper using ChatGPT and Selenium!
2023-08-17	Tutorial #7: Create a Chatbot with Gradio and ChatGPT!
2023-08-15	LLMs as General Pattern Machines: Use Arbitrary Tokens to Pattern Match?
2023-08-08	Tutorial #6: LangChain & StrictJSON Implementation of Knowledge Graph Question Answer with LLMs
2023-08-08	Large Language Models and Knowledge Graphs: Merging Flexibility and Structure
2023-07-31	Tutorial #5: SymbolicAI - Automatic Retrieval Augmented Generation, Multimodal Inputs, User Packages
2023-07-27	How Llama 2 works: Ghost Attention, Quality Supervised Fine-tuning, RLHF for Safety and Helpfulness
2023-07-27	Llama 2 vs ChatGPT

Channel	Latest
Scott Jund	6 hours ago
Smutsen	6 hours ago
BeastyqtSC2	6 hours ago
Exalted	6 hours ago
Bonkol Live	6 hours ago
Teh Spearhead	6 hours ago
Ashe Challenger	6 hours ago
Austinmp88	6 hours ago
Ask About Parenting & Care	6 hours ago
GranaDy	7 hours ago
Catninja909	7 hours ago
Sion VOD Gaming	7 hours ago
mlodyhubson	7 hours ago
Outplanet Studios	7 hours ago
RakuInariLP	7 hours ago
Xmilek62	7 hours ago
BranOnline	7 hours ago
ketsueki_randi	7 hours ago
beavsbaut	7 hours ago
JugZone	7 hours ago
PIMPNITE	7 hours ago
ItzMiketheman	7 hours ago
Secretnc	7 hours ago
Jeisonlk	7 hours ago
Kaghoegaming	7 hours ago