LLM Agents as a System (Prelim Findings Sharing): An Attempt to solve a 2-player 2D Escape Room!

Subscribers:
5,330
Published on ● Video Link: https://www.youtube.com/watch?v=OQeBg7-pME4



Duration: 1:37:14
243 views
9


We started building this system as part of an LLM Agent Hackathon, and the idea is to have collaborative actions taken by LLM Agents to solve an overall task!

Richard shares his conversational framework where the agents converse with one another and with a coach to give feedback to the agents.

I share about my framework which uses a planner to plan macro-level actions for the two agents, and the two agents will then use these macro-level actions as their goal and plan micro-level actions. In order to save inference costs, the micro-level plan is in the form of a list of actions, which will be taken one at a time until completion before querying the planner again for a new goal.

We share tips about what works and what does not, as well as some suggested reason behind them. In short, navigation is not native to LLMs and we need to ground it in the environment space like move (1, 0) rather than just text - the main thing for LLM Agents is that there must be semantic grounding in the action space. The order of prompting also matters and the LLM Agent can be prompted to give more specific outputs if we prompt it in the right sequence for it to make decisions in a step by step (broad to specific) manner.

As improvement, we can consider giving the environment feedback to the agents for planning (like Voyager for Minecraft), and we can also have a better way of feeding context into the LLM, perhaps using context-dependent knowledge graphs.

~~~~~~~~~~~~~~~~~~~~~

Richard Github Repo: https://github.com/richard087/chatterbug
John Github Repo: https://github.com/tanchongmin/strictjson/tree/main/EscapeRoom

Voyager: https://www.youtube.com/watch?v=Y-pgbjTlYgk
Ghost in the MineCraft: https://www.youtube.com/watch?v=_VXOczXIkks

~~~~~~~~~~~~~~~~~~~~~

0:00 Introduction
1:06 Richard’s Work on Conversational Agents with a Coach
45:19 Step on two squares 2D Escape Room
1:13:36 Pick-Place 2D Escape Room

~~~~~~~~~~~~~~~~~~~~~

AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.

Discord: https://discord.gg/bzp87AHJy5
LinkedIn: https://www.linkedin.com/in/chong-min-tan-94652288/
Online AI blog: https://delvingintotech.wordpress.com/
Twitter: https://twitter.com/johntanchongmin
Try out my games here: https://simmer.io/@chongmin




Other Videos By John Tan Chong Min


2023-11-03AI & Education: RAG Question-Answer, Test Question Generator, Autograder by Kuang Wen! (Part 1)
2023-10-31A Roadmap for AI: Past, Present and Future (Part 1)
2023-10-28Tutorial #10: StrictJSON v2 (StrictText): Handle any output - quotation marks or backslash!
2023-10-24ChatDev: Can LLM Agents really replace a software company?
2023-10-17LLMs and Robotics: An Overview by Daniel Tan!
2023-10-17LLM Q&A #1: Prompting vs Fine-Tuning, More vs Fewer Sources for RAG, Prompting vs LLMs as a System
2023-10-10LLMs as a System of Multiple Expert Agents to solve the ARC Challenge (Detailed Walkthrough)
2023-09-26Everything about LLM Agents - Chain of Thought, Reflection, Tool Use, Memory, Multi-Agent Framework
2023-09-19Moving Beyond Probabilities: Memory as World Modelling
2023-09-05Symbolic Regression: Doing What LLMs cannot - Deriving Arbitrary Mathematical Relations!
2023-08-29LLM Agents as a System (Prelim Findings Sharing): An Attempt to solve a 2-player 2D Escape Room!
2023-08-23LLM as Pattern Machines(Part 2) - Goal Directed Decision Transformers, 10-Year Plan for Intelligence
2023-08-18Tutorial #9: Evolution Game v2: ChatGPT (Text) and Dall-E (Image) API Integration!
2023-08-17Tutorial #8: Create a Web Scraper using ChatGPT and Selenium!
2023-08-17Tutorial #7: Create a Chatbot with Gradio and ChatGPT!
2023-08-15LLMs as General Pattern Machines: Use Arbitrary Tokens to Pattern Match?
2023-08-08Tutorial #6: LangChain & StrictJSON Implementation of Knowledge Graph Question Answer with LLMs
2023-08-08Large Language Models and Knowledge Graphs: Merging Flexibility and Structure
2023-07-31Tutorial #5: SymbolicAI - Automatic Retrieval Augmented Generation, Multimodal Inputs, User Packages
2023-07-27How Llama 2 works: Ghost Attention, Quality Supervised Fine-tuning, RLHF for Safety and Helpfulness
2023-07-27Llama 2 vs ChatGPT