"Are you smarter than an LLM?" game speedrun VIDEO
Game Link: https://simbian.ai/smarter-than-llm
I am trying out this new game by Simbian, which involves interacting with an LLM to uncover a secret word in a news article, and then getting the LLM to speak the secret word.
But alas, it is not that easy. The LLM has been prompted to not reveal the secret word. Can you break it?
[Spoiler Alert] Here, I show my gameplay for today's (20 Dec 2023) secret word.
[Disclaimer] I helped create the initial prototype of this game:) Prince and Vishwas helped to make it even better:)
~~~~~~~~~~~
AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.
Discord: https://discord.gg/bzp87AHJy5
LinkedIn: https://www.linkedin.com/in/chong-min-tan-94652288/
Online AI blog: https://delvingintotech.wordpress.com/
Twitter: https://twitter.com/johntanchongmin
Try out my games here: https://simmer.io/@chongmin
Other Videos By John Tan Chong Min 2024-03-11 CRADLE (Part 1) - AI that plays Red Dead Redemption 2. Towards General Computer Control and AGI 2024-03-05 TaskGen - A Task-based Agentic Framework using StrictJSON at the core 2024-02-27 SymbolicAI / ExtensityAI Paper Overview (Part 2) - Evaluation Benchmark Discussion! 2024-02-20 SymbolicAI / ExtensityAI Paper Overview (Part 1) - Key Philosophy Behind the Design - Symbols 2024-02-13 Embeddings Walkthrough (Part 2): Context-Dependent Embeddings, Shifting Embedding Space 2024-02-06 Embeddings Walkthrough (Part 1) - Bag of Words to word2vec to Transformer contextual embeddings 2024-01-29 V* - Better than GPT-4V? Iterative Context Refining for Visual Question Answer! 2024-01-23 AutoGen: A Multi-Agent Framework - Overview and Improvements 2024-01-09 AppAgent: Using GPT-4V to Navigate a Smartphone! 2024-01-08 Tutorial #13: StrictJSON, my first Python Package! - Get LLMs to output into a working JSON! 2023-12-20 "Are you smarter than an LLM?" game speedrun 2023-12-08 Is Gemini better than GPT4? Self-created benchmark - Fact Retrieval/Checking, Coding, Tool Use 2023-12-04 Learning, Fast and Slow: 10 Years Plan - Memory Soup, Hier. Planning, Emotions, Knowledge Sharing 2023-12-01 Tutorial #12: Use ChatGPT and off-the-shelf RAG on Terminal/Command Prompt/Shell - SymbolicAI 2023-11-20 JARVIS-1: Multi-modal (Text + Image) Memory + Decision Making with LLMs in MineCraft! 2023-11-20 Tutorial #11: Virtual Persona from Documents, Multi-Agent Chat, Text-to-Speech to hear your Personas 2023-11-14 A Roadmap for AI: Past, Present and Future (Part 3) - Multi-Agent, Multiple Sampling and Filtering 2023-11-07 Learning, Fast and Slow: My Landmark Idea for fast, adaptable agents (ICDL 2023 Best Paper Finalist) 2023-11-06 A roadmap for AI: Past, Present and Future (Part 2): Fixed vs Flexible, Memory Soup vs Hierarchy 2023-11-03 AI & Education: Education when AI tools are smarter than us - Discussion with Kuang Wen (Part 2) 2023-11-03 AI & Education: RAG Question-Answer, Test Question Generator, Autograder by Kuang Wen! (Part 1)