SymbolicAI / ExtensityAI Paper Overview (Part 2) - Evaluation Benchmark Discussion!

Channel:

John Tan Chong Min

Subscribers:

6,300

Published on February 27, 2024 2:37:17 PM ● Video Link: https://www.youtube.com/watch?v=JYipf3felQw

Category:

Discussion

Duration: 2:05:10

123 views

Had a great technical breakdown of the evaluation benchmark of SymbolicAI, along with some of my suggestions and discussions with Marius!

Part 1 here: https://studio.youtube.com/video/FXx5sHsXh0I

It's open source, so do check out their repositories:
Paper - https://arxiv.org/abs/2402.00854
SymbolicAI - https://github.com/ExtensityAI/symbolicai
SymbolicAI Benchmark - https://github.com/ExtensityAI/benchmark

~~~

AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.

Discord: https://discord.gg/bzp87AHJy5
LinkedIn: https://www.linkedin.com/in/chong-min-tan-94652288/
Online AI blog: https://delvingintotech.wordpress.com/
Twitter: https://twitter.com/johntanchongmin
Try out my games here: https://simmer.io/@chongmin

Other Videos By John Tan Chong Min

2024-05-11	Empirical - Open Source LLM Evaluation UI
2024-05-07	TaskGen Ask Me Anything #1
2024-04-29	StrictJSON (LLM Output Parser) Ask Me Anything #1
2024-04-22	Tutorial #14: Write latex papers with LLMs such as Llama 3!
2024-04-16	SORA Deep Dive: Predict patches from text, images or video
2024-04-09	OpenAI CLIP Embeddings: Walkthrough + Insights
2024-03-26	TaskGen - LLM Agentic Framework that Does More, Talks Less: Shared Variables, Memory, Global Context
2024-03-18	CRADLE (Part 2): An AI that can play Red Dead Dedemption 2. Reflection, Memory, Task-based Planning
2024-03-11	CRADLE (Part 1) - AI that plays Red Dead Redemption 2. Towards General Computer Control and AGI
2024-03-05	TaskGen - A Task-based Agentic Framework using StrictJSON at the core
2024-02-27	SymbolicAI / ExtensityAI Paper Overview (Part 2) - Evaluation Benchmark Discussion!
2024-02-20	SymbolicAI / ExtensityAI Paper Overview (Part 1) - Key Philosophy Behind the Design - Symbols
2024-02-13	Embeddings Walkthrough (Part 2): Context-Dependent Embeddings, Shifting Embedding Space
2024-02-06	Embeddings Walkthrough (Part 1) - Bag of Words to word2vec to Transformer contextual embeddings
2024-01-29	V* - Better than GPT-4V? Iterative Context Refining for Visual Question Answer!
2024-01-23	AutoGen: A Multi-Agent Framework - Overview and Improvements
2024-01-09	AppAgent: Using GPT-4V to Navigate a Smartphone!
2024-01-08	Tutorial #13: StrictJSON, my first Python Package! - Get LLMs to output into a working JSON!
2023-12-20	"Are you smarter than an LLM?" game speedrun
2023-12-08	Is Gemini better than GPT4? Self-created benchmark - Fact Retrieval/Checking, Coding, Tool Use
2023-12-04	Learning, Fast and Slow: 10 Years Plan - Memory Soup, Hier. Planning, Emotions, Knowledge Sharing

Channel	Latest
Kessben TV	6 hours ago
MANISH PURI ARMY	6 hours ago
Video Gamers Oasis	6 hours ago
Combine Force	6 hours ago
FruityOoty	6 hours ago
bycortes91	6 hours ago
Bellagio Gaming	6 hours ago
Vailskibum	6 hours ago
CurioX	6 hours ago
Kacchi Hakuteiken	6 hours ago
AkaLxndon	6 hours ago
Android Gaming	6 hours ago
berry	6 hours ago
DanescuOana&Daniel	6 hours ago
LeaoDaJustiça	6 hours ago
SL4M & Counter-Strike	6 hours ago
RAGMAN-50	6 hours ago
SEMTEX	7 hours ago
LINzzO	7 hours ago
Wander Crippi	7 hours ago
Arron Owen	7 hours ago
[Kokorov]	7 hours ago
DrBusyMan	7 hours ago
XCageGame	7 hours ago
Awwesome Ummz	7 hours ago