NeoPlanner - Continually Learning Planning Agent for Large Environments guided by LLMs

Channel:

John Tan Chong Min

Subscribers:

6,300

Published on July 23, 2024 9:10:39 AM ● Video Link: https://www.youtube.com/watch?v=5GVfsK1K0K4

Duration: 1:49:08

451 views

Excited to have Swarna present his work on state space graph-based Planning, and how he used an Explore-Exploit approach to build and use this state space graph for future action planning.

Speaker Profile - Swarna Kamal Paul completed his PhD on general problem solving agents from Jadavpur University on 2023. He also have 15 years of work experience in IT industry including consultancy, research and software application development. Currently he is affiliated to TCS India. His research interests includes adaptable agents, LLM based agentic systems, general AI, integrative AI.

Abstract - Sequential planning in large state space and action space quickly becomes intractable due to combinatorial explosion of the search space. To solve this problem I propose a hybrid agent - “neoplanner”, that synergizes both state space search with queries to foundational LLM to get the best action plan. The reward signals (wherever obtained) from the environment are quantitatively used to drive the search. In places where random exploration is needed, the LLM is queried to generate an action plan. Observations from environment during exploration are converted to learnings and stored in text format as memory. These are eventually used to refine the search. Experiments in the ScienceWorld environment reveals a 124% improvement from the current best method in terms of average reward gained across multiple tasks.

Paper - https://arxiv.org/abs/2312.07368
Code - https://github.com/swarna-kpaul/neoplanner

~~~

My related works referred to in this discussion:
TaskGen: https://www.youtube.com/watch?v=O_XyTT7QGH4
Learning, Fast and Slow: https://www.youtube.com/watch?v=DSVFA7nmwHQ

~~~

0:00 Introduction
1:03 What is Planning
6:05 Why is Planning hard?
17:45 ScienceWorld
20:51 Problems with Prior Methods
28:10 Problems with Planning with LLMs
32:03 Overall Method Overview
34:30 State Space Search
47:26 State Space Search Equations
1:04:05 Learning Value Functions via TD Learning
1:07:21 Overall Agent Architecture
1:15:10 Exploration Prompt
1:22:13 Experiment Details
1:24:27 Discussion

~~~

AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.

Discord: https://discord.gg/bzp87AHJy5
LinkedIn: https://www.linkedin.com/in/chong-min-tan-94652288/
Online AI blog: https://delvingintotech.wordpress.com/
Twitter: https://twitter.com/johntanchongmin
Try out my games here: https://simmer.io/@chongmin

Other Videos By John Tan Chong Min

2024-09-23	TaskGen Tutorial 3: Memory
2024-09-19	TaskGen Tutorial 2: Shared Variables and Global Context
2024-09-16	Beyond Strawberry: gpt-o1 - Is LLM alone sufficient for reasoning?
2024-09-11	TaskGen Tutorial 1: Agents and Equipped Functions
2024-09-11	TaskGen Tutorial 0: StrictJSON
2024-09-10	LLM-Modulo: Using Critics and Verifiers to Improve Grounding of a Plan - Explanation + Improvements
2024-09-06	TaskGen: Co-create the best open-sourced LLM Agentic Framework together!
2024-08-21	AriGraph (Part 2) - Knowledge Graph Construction and Retrieval Details
2024-08-13	alphaXiv - Share Ideas, Build Collective Understanding, Interact with ANY open sourced paper authors
2024-07-30	AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents
2024-07-23	NeoPlanner - Continually Learning Planning Agent for Large Environments guided by LLMs
2024-07-17	Intelligence = Sampling + Filtering
2024-07-12	Michael Hodel: Reverse Engineering the Abstraction and Reasoning Corpus
2024-07-02	TaskGen Conversational Class v2: JARVIS, Psychology Counsellor, Sherlock Holmes Shop Assistant
2024-06-04	CodeAct: Code As Action Space of LLM Agents - Pros and Cons
2024-05-28	TaskGen Conversation with Dynamic Memory - Math Quizbot, Escape Room Solver, Psychology Counsellor
2024-05-21	Integrate ANY Python Function, CodeGen, CrewAI tool, LangChain tool with TaskGen! - v2.3.0
2024-05-11	Empirical - Open Source LLM Evaluation UI
2024-05-07	TaskGen Ask Me Anything #1
2024-04-29	StrictJSON (LLM Output Parser) Ask Me Anything #1
2024-04-22	Tutorial #14: Write latex papers with LLMs such as Llama 3!

Channel	Latest
Yasuo Legends	7 hours ago
Jason's Video Games Source	7 hours ago
domisumReplay: Swain	8 hours ago
BaLoRi	9 hours ago
TG Plays	9 hours ago
Top5Gaming	10 hours ago
MrDalekJD	11 hours ago
Happy Animes Recaps	11 hours ago
JuegaGerman	11 hours ago
BlackySpeakz	12 hours ago
이스타TV	12 hours ago
SonAze Tails Fan	12 hours ago
BombCentralGaming	12 hours ago
The Serf Times	12 hours ago
DStrike	12 hours ago
capcomasia	12 hours ago
Thresh Challenger	12 hours ago
Wrexham AFC	12 hours ago
HellGhost Call of Duty	12 hours ago
JustSaySteven	12 hours ago
Misty Kathrine	12 hours ago
FraFilter	12 hours ago
Chupacabra Tutorials	12 hours ago
Andrew’s Movie Corner	13 hours ago
Vinicius 2Dánvi	13 hours ago