A Roadmap for AI: Past, Present and Future (Part 3) - Multi-Agent, Multiple Sampling and Filtering
In this third and final session, we move beyond the individual system - moving beyond a basic formulation like GPTs - and going to multi-agent, multi-population systems.
We will also talk about the importance of reflection and memory sharing between agents.
We will also debate whether Artificial General Intelligence / Artificial Super Intelligence and the Singularity can be achieved. If there were a catchphrase for AGI/ASI, I would propose it to be "Multiple Sampling and Filtering" within each agent, and across agents.
Recap of Session 1 (Past/Present AI systems): Expert Knowledge Systems (learning rules from experts), Supervised Learning (human-labelled data), Unsupervised Learning/Self-Supervised Learning (non human-labelled data), Foundational Models to learn from data and set a baseline for performance.
Recap of Session 2 (Present/Future AI systems): Merging Fixed Structure + Flexible Learning, Imbuing memory for learning. Memory can exist in hierarchical form, or in multiple parallel abstraction/latent spaces.
~~~~~~~~~~~~~~~~~
Slides: https://github.com/tanchongmin/TensorFlow-Implementations/blob/main/Paper_Reviews/A%20Roadmap%20for%20AI%20(Final).pdf
I believe multiple sampling and filtering is the key to self-improvement and AGI/ASI.
Here are some selected works:
Multiple sampling of memory to select best trajectory: Learning, Fast and Slow - Mine (https://arxiv.org/abs/2301.13758)
Multiple sampling of possible programs with different abstraction spaces: Multiple Expert Systems in ARC - Mine (https://arxiv.org/abs/2310.05146)
Monte Carlo Tree Search to sample multiple futures and learn from the best one: AlphaGo / AlphaZero - DeepMind (https://www.nature.com/articles/nature24270)
Multiple Code generation to solve competitive programming: AlphaCode - DeepMind (https://arxiv.org/abs/2203.07814)
Multi-agent population-based methods: AlphaStar - DeepMind (https://deepmind.google/discover/blog/alphastar-mastering-the-real-time-strategy-game-starcraft-ii/)
Multiple sampling by agents to solve multimodal image-text MineCraft environment: JARVIS-1 (https://arxiv.org/pdf/2311.05997.pdf)
~~~~~~~~~~~~~~~~~
Other References:
Generative Agents: https://www.youtube.com/watch?v=_pkktFIcZRo
OpenAI GPTs: https://openai.com/blog/introducing-gpts
Learning, Fast and Slow: https://www.youtube.com/watch?v=DSVFA7nmwHQ
Hierarchical Temporal Memory (HTM) by Numenta: https://www.numenta.com/resources/research-publications/papers/hierarchical-temporal-memory-white-paper/
LLMs as a System of Multiple Expert Agents to Solve the ARC Challenge: https://www.youtube.com/watch?v=sTvonsD5His
ChatDev (LLM agents to simulate software company): https://www.youtube.com/watch?v=5sXqpCIIuT8
Voyager (LLM agent to self-learn the MineCraft environment): https://www.youtube.com/watch?v=Y-pgbjTlYgk
~~~~~~~~~~~~~~~~~
0:00 Introduction
7:15 Learning through Reflection
18:00 Agent Overview
24:45 Basic Agent - GPTs
27:25 Multiple Agents within same system
33:08 Multiple Specialised Agents within same system
38:10 Learning Skills via Interaction with Environment
49:20 Collective Intelligence
56:29 Knowledge Sharing between Agents
1:13:40 Intelligence via Multiple Populations
1:16:03 Can AGI/ASI be achieved?
1:35:09 Can the singularity be reached?
1:39:58 Discussion
~~~~~~~~~~~~~~~~~
AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.
Discord: https://discord.gg/bzp87AHJy5
LinkedIn: https://www.linkedin.com/in/chong-min-tan-94652288/
Online AI blog: https://delvingintotech.wordpress.com/
Twitter: https://twitter.com/johntanchongmin
Try out my games here: https://simmer.io/@chongmin
Other Videos By John Tan Chong Min
Other Statistics
StarCraft Statistics For John Tan Chong Min
There are 431 views in 1 video for StarCraft. About an hours worth of StarCraft videos were uploaded to his channel, less than 0.58% of the total video content that John Tan Chong Min has uploaded to YouTube.