AutoGen: A Multi-Agent Framework - Overview and Improvements

Subscribers:
5,330
Published on ● Video Link: https://www.youtube.com/watch?v=2VIoG4_r3-A



Duration: 2:01:19
1,910 views
58


AutoGen is a popular multi-agent framework by Microsoft that enables you to define your own agents by specifying the name and the description (as the system prompt) for each agent, and setting up a two-agent or group setting for conversation to happen.

You can integrate various custom replies via the register_reply function, which specify how to reply to specific triggers (e.g. specific agents, specific keywords in messages).

You can also integrate the agents with pre-defined tools/functions by specifying them in "llm-config" and registering the function via "register_function". See https://medium.com/@coldstart_coder/autogen-essentials-function-integration-for-smarter-agents-7c3b4a0fdc12 for details.

All in all, AutoGen is a great first start for a multi-agent framework. I personally find the prompt very lengthy, and the conversation framework very verbose. It would be better if we have a more streamlined workflow (perhaps a templated list of actions tailored to a particular use case) with shorter LLM prompts for it to be more performant.

At the last part of the presentation, Soham Sarkar (@Crit) demonstrated his working implementation of AutoGen for his own use case of adding a company into a database!

~~~~~

Github: https://github.com/microsoft/autogen
Paper: https://arxiv.org/abs/2308.08155

My template AutoGen notebook: https://github.com/tanchongmin/TensorFlow-Implementations/blob/main/Paper_Reviews/AutoGen/AutoGen.ipynb
My AutoGen Slides: https://github.com/tanchongmin/TensorFlow-Implementations/blob/main/Paper_Reviews/AutoGen/AutoGen%20Slides.pdf

For those interested in the StrictJSON framework I talked about at 34:18, here it is: https://github.com/tanchongmin/strictjson

~~~~~

0:00 Introduction and Code Walkthrough
23:54 Basic Agent Structure
32:04 Prompt for Agents
36:13 RAG on Demand Agent
39:08 Grounding Message Agent
46:31 Red-teaming and Blue-teaming Agents
48:26 Advanced Topics - Dynamic Conversation Flow
50:44 More Agents = Better?
53:27 Self-correcting code with a suite of agents
55:00 Pseudo Multi-Agent Conversation
57:45 Is conversation really necessary to do tasks?
59:02 Takeaways from AutoGen
1:03:34 Discussion
1:24:05 Demo by Sarkar!

~~~~~

AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.

Discord: https://discord.gg/bzp87AHJy5
LinkedIn: https://www.linkedin.com/in/chong-min-tan-94652288/
Online AI blog: https://delvingintotech.wordpress.com/
Twitter: https://twitter.com/johntanchongmin
Try out my games here: https://simmer.io/@chongmin




Other Videos By John Tan Chong Min


2024-04-09OpenAI CLIP Embeddings: Walkthrough + Insights
2024-03-26TaskGen - LLM Agentic Framework that Does More, Talks Less: Shared Variables, Memory, Global Context
2024-03-18CRADLE (Part 2): An AI that can play Red Dead Dedemption 2. Reflection, Memory, Task-based Planning
2024-03-11CRADLE (Part 1) - AI that plays Red Dead Redemption 2. Towards General Computer Control and AGI
2024-03-05TaskGen - A Task-based Agentic Framework using StrictJSON at the core
2024-02-27SymbolicAI / ExtensityAI Paper Overview (Part 2) - Evaluation Benchmark Discussion!
2024-02-20SymbolicAI / ExtensityAI Paper Overview (Part 1) - Key Philosophy Behind the Design - Symbols
2024-02-13Embeddings Walkthrough (Part 2): Context-Dependent Embeddings, Shifting Embedding Space
2024-02-06Embeddings Walkthrough (Part 1) - Bag of Words to word2vec to Transformer contextual embeddings
2024-01-29V* - Better than GPT-4V? Iterative Context Refining for Visual Question Answer!
2024-01-23AutoGen: A Multi-Agent Framework - Overview and Improvements
2024-01-09AppAgent: Using GPT-4V to Navigate a Smartphone!
2024-01-08Tutorial #13: StrictJSON, my first Python Package! - Get LLMs to output into a working JSON!
2023-12-20"Are you smarter than an LLM?" game speedrun
2023-12-08Is Gemini better than GPT4? Self-created benchmark - Fact Retrieval/Checking, Coding, Tool Use
2023-12-04Learning, Fast and Slow: 10 Years Plan - Memory Soup, Hier. Planning, Emotions, Knowledge Sharing
2023-12-01Tutorial #12: Use ChatGPT and off-the-shelf RAG on Terminal/Command Prompt/Shell - SymbolicAI
2023-11-20JARVIS-1: Multi-modal (Text + Image) Memory + Decision Making with LLMs in MineCraft!
2023-11-20Tutorial #11: Virtual Persona from Documents, Multi-Agent Chat, Text-to-Speech to hear your Personas
2023-11-14A Roadmap for AI: Past, Present and Future (Part 3) - Multi-Agent, Multiple Sampling and Filtering
2023-11-07Learning, Fast and Slow: My Landmark Idea for fast, adaptable agents (ICDL 2023 Best Paper Finalist)