SymbolicAI / ExtensityAI Paper Overview (Part 2) - Evaluation Benchmark Discussion! VIDEO
Had a great technical breakdown of the evaluation benchmark of SymbolicAI, along with some of my suggestions and discussions with Marius!
Part 1 here: https://studio.youtube.com/video/FXx5sHsXh0I
It's open source, so do check out their repositories:
Paper - https://arxiv.org/abs/2402.00854
SymbolicAI - https://github.com/ExtensityAI/symbolicai
SymbolicAI Benchmark - https://github.com/ExtensityAI/benchmark
~~~
AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.
Discord: https://discord.gg/bzp87AHJy5
LinkedIn: https://www.linkedin.com/in/chong-min-tan-94652288/
Online AI blog: https://delvingintotech.wordpress.com/
Twitter: https://twitter.com/johntanchongmin
Try out my games here: https://simmer.io/@chongmin
Other Videos By John Tan Chong Min 2024-05-11 Empirical - Open Source LLM Evaluation UI 2024-05-07 TaskGen Ask Me Anything #1 2024-04-29 StrictJSON (LLM Output Parser) Ask Me Anything #1 2024-04-22 Tutorial #14: Write latex papers with LLMs such as Llama 3! 2024-04-16 SORA Deep Dive: Predict patches from text, images or video 2024-04-09 OpenAI CLIP Embeddings: Walkthrough + Insights 2024-03-26 TaskGen - LLM Agentic Framework that Does More, Talks Less: Shared Variables, Memory, Global Context 2024-03-18 CRADLE (Part 2): An AI that can play Red Dead Dedemption 2. Reflection, Memory, Task-based Planning 2024-03-11 CRADLE (Part 1) - AI that plays Red Dead Redemption 2. Towards General Computer Control and AGI 2024-03-05 TaskGen - A Task-based Agentic Framework using StrictJSON at the core 2024-02-27 SymbolicAI / ExtensityAI Paper Overview (Part 2) - Evaluation Benchmark Discussion! 2024-02-20 SymbolicAI / ExtensityAI Paper Overview (Part 1) - Key Philosophy Behind the Design - Symbols 2024-02-13 Embeddings Walkthrough (Part 2): Context-Dependent Embeddings, Shifting Embedding Space 2024-02-06 Embeddings Walkthrough (Part 1) - Bag of Words to word2vec to Transformer contextual embeddings 2024-01-29 V* - Better than GPT-4V? Iterative Context Refining for Visual Question Answer! 2024-01-23 AutoGen: A Multi-Agent Framework - Overview and Improvements 2024-01-09 AppAgent: Using GPT-4V to Navigate a Smartphone! 2024-01-08 Tutorial #13: StrictJSON, my first Python Package! - Get LLMs to output into a working JSON! 2023-12-20 "Are you smarter than an LLM?" game speedrun 2023-12-08 Is Gemini better than GPT4? Self-created benchmark - Fact Retrieval/Checking, Coding, Tool Use 2023-12-04 Learning, Fast and Slow: 10 Years Plan - Memory Soup, Hier. Planning, Emotions, Knowledge Sharing