Is GPT4 capable of self-improving? Are we heading for AGI or AI doom?

Subscribers:
5,330
Published on ● Video Link: https://www.youtube.com/watch?v=eu326Owrukg



Duration: 10:01
645 views
18


Recently, I have noticed plenty of arguments against AI and even a petition curtailing the training of LLMs bigger than GPT4. Will AI eventually lead us to our doom? Is GPT4 capable of self-improving?

I do a simple experiment on trying to find out the nth prime number as an illustration to show that the model is ultimately limited by its own generation. To improve it, it will need to be given the right teaching signal. Perhaps tools can help to improve it, but we will still be limited by the capabilities of the model and tools.

Rather than be worried by whether AGI will destroy us, I feel like we are still far from achieving AGI. AlphaZero only works on a narrow domain such as Go or chess, and it will take a long while more before we can get general self-improving algorithms. And that may not even be possible.

Furthermore, we can just avoid linking powerful AI to an action space which has negative implications for us, such as weapons. Instead of limiting technology, we can focus on how to use it to enhance our daily lives.

~~~~~~~~~~~~~~~~~~~~~~~

Related links:
Petition to pause GPT-4 experiments: https://futureoflife.org/open-letter/pause-giant-ai-experiments/
How ChatGPT works: https://www.youtube.com/watch?v=wA8rjKueB3Q
How Visual ChatGPT works (LLMs + Tools): https://www.youtube.com/watch?v=J1Xj0xXmtHU
AlphaGo/AlphaZero Introduction: https://youtu.be/a4lrFMSwGWQ
Learning, Fast and Slow (Memory to imbue better performance in AI systems): https://www.youtube.com/watch?v=Hr9zW7Usb7I


0:00 Introduction
1:05 Prime Number experiment
4:07 Discussion on Self-Improvement
5:16 Comparison to Alpha Zero
6:34 A way for improvement via tools
7:25 How to prevent AI Doom
8:23 Memory as the way ahead
9:25 Concluding note on AGI

~~~~~~~~~~~~~~~~~~~~~~~

AI and ML enthusiast. Likes to think about the essences behind breakthroughs of AI and explain it in a simple and relatable way. Also, I am an avid game creator.

Discord: https://discord.gg/fXCZCPYs
LinkedIn: https://www.linkedin.com/in/chong-min-tan-94652288/
Online AI blog: https://delvingintotech.wordpress.com/.
Twitter: https://twitter.com/johntanchongmin
Try out my games here: https://simmer.io/@chongmin




Other Videos By John Tan Chong Min


2023-05-02I created a Law Court Simulator with GPT4!
2023-05-02I created a Law Court Simulator with ChatGPT!
2023-04-25Creating a ChatGPT Harry Potter Text-based RPG game!
2023-04-25Learn from just Memory Storage and Retrieval: Generative Agents Interacting in Simulation!
2023-04-18The future is neuro-symbolic: Expressiveness of ChatGPT and generalizability of symbols (SymbolicAI)
2023-04-17Can GPT4 solve the Abstraction and Reasoning Corpus (ARC) Challenge Zero-Shot?
2023-04-12GPT4: Zero-shot Classification without any examples + Fine-tune with reflection
2023-04-11OpenAI Vector Embeddings - Talk to any book or document; Retrieval-Augmented Generation!
2023-04-11Tutorial #2: OpenAI Vector Embeddings and Pinecone for Retrieval-Augmented Generation
2023-04-04Creating JARVIS: ChatGPT + APIs - HuggingGPT, Memory-Augmented Context, Meta GPT structures
2023-04-02Is GPT4 capable of self-improving? Are we heading for AGI or AI doom?
2023-03-28How Visual ChatGPT works + Toolformer/Wolfram Alpha. LLMs with Tools/APIs/Plugins is the way ahead!
2023-03-21Tokenize any input, even continuous vectors! - Residual Vector Quantization - VALL-E (Part 2)
2023-03-07Using Transformers to mimic anyone's voice! - VALL-E (Part 1)
2023-02-28Learning Part-Whole Structure by Chunking - More Efficient than Deep Learning!!!
2023-02-21High-level planning with large language models - SayCan
2023-02-13Learning, Fast and Slow: Towards Fast and Adaptable Agents in Changing Environments
2023-02-07Using Logic Gates as Neurons - Deep Differentiable Logic Gate Networks!
2023-01-31Learn from External Memory, not just Weights: Large-Scale Retrieval for Reinforcement Learning
2023-01-17How ChatGPT works - From Transformers to Reinforcement Learning with Human Feedback (RLHF)
2023-01-09HyperTree Proof Search - Automated Theorem Proving with AlphaZero and Transformers!