Learn from just Memory Storage and Retrieval: Generative Agents Interacting in Simulation!

Channel:

John Tan Chong Min

Subscribers:

5,450

Published on April 25, 2023 1:30:40 PM ● Video Link: https://www.youtube.com/watch?v=_pkktFIcZRo

Duration: 1:48:30

539 views

Memory plays a huge role in our lives. What if I told you that just by changing your memory and initial biases, I can change your behavior?

That is just what a group of researchers at Stanford and Google Brain did. They built a sandbox simulator like the Sims, initialized 25 agents with different personalities and relationships, with different goals in life. They let the agents run free and use natural language to select actions, as well as talk to one another if they are in proximity and agree to talk. They let them accumulate information about the world, perform reflections to consolidate memory, and use memory retrieval to select actions (similar to Retrieval-Augmented Generation, except that we are generating actions).

Most importantly, everything is stored in a hierarchy. Memories build upon each other. World representations are symbolic and form part-whole relationships naturally. Action selection and daily planning is also done in a hierarchy from coarse grain to the details.

This work is very interesting and paves the way ahead for memory-based learning. There are still many improvements to be made, like scoring importance of memory as classification instead of rating (yes, I tested it and it works better), implementing a better forgetting mechanism based on Ebbinghaus' forgetting curve, removing redundant memories, having a "conscious" stream to process environmental interactions instead of just extraction from the memory stream, and many more. That said, I still like this work a lot and it is synergistic with a lot of the concepts about memory I want to research on.

2023 will be all about memory, as we unlock the potential of Large Language Models (LLMs) with better grounding through memory.

~~~~~~~~~~~~~~~~~~~~

Paper: https://arxiv.org/abs/2304.03442
Paper Demo (Watch re-run of simulation): https://reverie.herokuapp.com/arXiv_Demo/
Slides: https://github.com/tanchongmin/TensorFlow-Implementations/blob/main/Paper_Reviews/Agent%20Simulation%20Slides.pdf
Harry Potter ChatGPT Text-based RPG game: https://www.youtube.com/watch?v=yF-bXyImLio&list=PLcORGO1bTgjWPj_wup5-XtFGNELuN35ou

A Github repo to recreate this (by mkturkcan, using Alpaca instead of OpenAI ChatGPT): https://github.com/mkturkcan/generative-agents

References:
Socratic Models (various modalities talk to each other in text): https://arxiv.org/abs/2204.00598
GPT4 can do some zero-shot classification: https://www.youtube.com/watch?v=C0Eug9XpcBo
GPT4 can zero-shot some of the Abstraction and Reasoning Corpus (ARC) Challenge: https://www.youtube.com/watch?v=vt2yG1da8Fg
OpenAI Vector embeddings: https://www.youtube.com/watch?v=lIoLCip0HwM
Learning, Fast and Slow (my own hypothesis of how we use memory): https://www.youtube.com/watch?v=Hr9zW7Usb7I

~~~~~~~~~~~~~~~~~~~~~

0:00 Introduction
1:45 Demo of Simulation (Overview)
5:08 Demo of Simulation (State Details and Memory)
14:00 How Agents are Prompted for Actions
20:14 Motivation of the memory-based learning
23:08 Recap on Memory
26:44 Can we learn from just memories alone?
36:10 Is memory related to personality?
39:00 Overall Architecture of Generative Agents
42:30 The power of prompting: Agent character, memories and background
44:42 Inter-Agent Communication
47:09 Taking Control of an Agent
50:11 Initial prompting can lead to cascade of actions
52:28 Memory Retrieval for Planning
1:00:05 Memory Retrieval Limit and Memory Stream
1:03:42 Agent’s prompt to rate importance of memory (and how zero-shot classification is actually better)
1:08:17 Reflections to consolidate memory
1:11:41 Planning for overall grounding of agent’s actions
1:14:01 Trees of Reflection
1:17:28 Symbolic Representation: Perceiving and Acting
1:21:42 Obtaining fine-grained actions via recursive prompting
1:25:22 Is the memory retrieval good?
1:28:15 Are the actions generated plausible?
1:29:24 Personal insight: how to better store and retrieve memories
1:31:20 Discussion

~~~~~~~~~~~~~~~~

AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.

Discord: https://discord.gg/fXCZCPYs
LinkedIn: https://www.linkedin.com/in/chong-min-tan-94652288/
Online AI blog: https://delvingintotech.wordpress.com/
Twitter: https://twitter.com/johntanchongmin
Try out my games here: https://simmer.io/@chongmin

Other Videos By John Tan Chong Min

2023-06-13	No more RL needed! LLMs for high-level planning: Voyager + Ghost In the Minecraft
2023-06-06	Voyager - An LLM-based curriculum generator, actor and critic, with skill reuse in Minecraft!
2023-06-01	Evolution ChatGPT Prompt Game - From Bacteria to.... Jellyfish???
2023-05-30	Prompt Engineering and LLMOps: Tips and Tricks
2023-05-25	Hierarchy! The future of AI: How it helps representations and why it is important.
2023-05-18	Prediction builds representations! Fixed Bias speeds up learning!
2023-05-09	Memory: How is it encoded, retrieved and how it can be used for learning systems
2023-05-02	I created a Law Court Simulator with GPT4!
2023-05-02	I created a Law Court Simulator with ChatGPT!
2023-04-25	Creating a ChatGPT Harry Potter Text-based RPG game!
2023-04-25	Learn from just Memory Storage and Retrieval: Generative Agents Interacting in Simulation!
2023-04-18	The future is neuro-symbolic: Expressiveness of ChatGPT and generalizability of symbols (SymbolicAI)
2023-04-17	Can GPT4 solve the Abstraction and Reasoning Corpus (ARC) Challenge Zero-Shot?
2023-04-12	GPT4: Zero-shot Classification without any examples + Fine-tune with reflection
2023-04-11	OpenAI Vector Embeddings - Talk to any book or document; Retrieval-Augmented Generation!
2023-04-11	Tutorial #2: OpenAI Vector Embeddings and Pinecone for Retrieval-Augmented Generation
2023-04-04	Creating JARVIS: ChatGPT + APIs - HuggingGPT, Memory-Augmented Context, Meta GPT structures
2023-04-02	Is GPT4 capable of self-improving? Are we heading for AGI or AI doom?
2023-03-28	How Visual ChatGPT works + Toolformer/Wolfram Alpha. LLMs with Tools/APIs/Plugins is the way ahead!
2023-03-21	Tokenize any input, even continuous vectors! - Residual Vector Quantization - VALL-E (Part 2)
2023-03-07	Using Transformers to mimic anyone's voice! - VALL-E (Part 1)

Channel	Latest
kang Sempel	6 hours ago
efdewe	7 hours ago
Konichiwa it'z me Angela	7 hours ago
ASimpleGamer	7 hours ago
League of Legends Store	7 hours ago
MtHelicon2077	7 hours ago
Shavepapa LoL Highlights	7 hours ago
APR Gamer	7 hours ago
Shoforo Metin2	7 hours ago
NOLLY DIASPORA TV	7 hours ago
Madame Récré FR	7 hours ago
Fireraven	7 hours ago
Blessed	7 hours ago
Subodh Sinha	7 hours ago
Glint	7 hours ago
とっと	8 hours ago
AMMU GAMER	8 hours ago
Wolfoo Family	8 hours ago
Yadin XD	8 hours ago
ForgeWolf G	8 hours ago
ParKilleRz Ch.	8 hours ago
Gaming Grandpa	8 hours ago
Dolynny TV	8 hours ago
Nora Dżawora	8 hours ago
Mẫn SpiderGaming	8 hours ago