Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution (Paper Explained)

Subscribers:
291,000
Published on ● Video Link: https://www.youtube.com/watch?v=tkX0EfNl4Fc



Duration: 46:45
33,980 views
956


#ai #promptengineering #evolution

Promptbreeder is a self-improving self-referential system for automated prompt engineering. Give it a task description and a dataset, and it will automatically come up with appropriate prompts for the task. This is achieved by an evolutionary algorithm where not only the prompts, but also the mutation-prompts are improved over time in a population-based, diversity-focused approach.

OUTLINE:
0:00 - Introduction
2:10 - From manual to automated prompt engineering
10:40 - How does Promptbreeder work?
21:30 - Mutation operators
36:00 - Experimental Results
38:05 - A walk through the appendix

Paper: https://arxiv.org/abs/2309.16797

Abstract:
Popular prompt strategies like Chain-of-Thought Prompting can dramatically improve the reasoning abilities of Large Language Models (LLMs) in various domains. However, such hand-crafted prompt-strategies are often sub-optimal. In this paper, we present Promptbreeder, a general-purpose self-referential self-improvement mechanism that evolves and adapts prompts for a given domain. Driven by an LLM, Promptbreeder mutates a population of task-prompts, and subsequently evaluates them for fitness on a training set. Crucially, the mutation of these task-prompts is governed by mutation-prompts that the LLM generates and improves throughout evolution in a self-referential way. That is, Promptbreeder is not just improving task-prompts, but it is also improving the mutationprompts that improve these task-prompts. Promptbreeder outperforms state-of-the-art prompt strategies such as Chain-of-Thought and Plan-and-Solve Prompting on commonly used arithmetic and commonsense reasoning benchmarks. Furthermore, Promptbreeder is able to evolve intricate task-prompts for the challenging problem of hate speech classification.

Authors: Chrisantha Fernando, Dylan Banarse, Henryk Michalewski, Simon Osindero, Tim Rocktäschel

Links:
Homepage: https://ykilcher.com
Merch: https://ykilcher.com/merch
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
Discord: https://ykilcher.com/discord
LinkedIn: https://www.linkedin.com/in/ykilcher

If you want to support me, the best thing to do is to share out the content :)

If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar: https://www.subscribestar.com/yannickilcher
Patreon: https://www.patreon.com/yannickilcher
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n




Other Videos By Yannic Kilcher


2023-12-09Text Embeddings Reveal (Almost) As Much As Text
2023-12-03Scalable Extraction of Training Data from (Production) Language Models (Paper Explained)
2023-12-03Just Chatting (OpenAssistant Goodbye Stream)
2023-11-25What is Q-Learning (back to basics)
2023-11-23Greg & Sam are BACK! (+ Q-Star is AGI) (Also Memes)
2023-11-19Is Sam Altman coming back? (OpenAI drama continues)
2023-11-18OpenAI just fired CEO Sam Altman
2023-11-08I built the most expensive CPU ever! (Every instruction is a prompt)
2023-10-24OpenAssistant is Completed
2023-10-14Efficient Streaming Language Models with Attention Sinks (Paper Explained)
2023-10-07Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution (Paper Explained)
2023-09-12Retentive Network: A Successor to Transformer for Large Language Models (Paper Explained)
2023-09-03Reinforced Self-Training (ReST) for Language Modeling (Paper Explained)
2023-08-15[ML News] LLaMA2 Released | LLMs for Robots | Multimodality on the Rise
2023-08-14How Cyber Criminals Are Using ChatGPT (w/ Sergey Shykevich)
2023-08-13Recipe AI suggests FATAL CHLORINE GAS Recipe
2023-08-12DeepFloyd IF - Pixel-Based Text-to-Image Diffusion (w/ Authors)
2023-06-20[ML News] GPT-4 solves MIT Exam with 100% ACCURACY | OpenLLaMA 13B released
2023-06-06Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust (Explained)
2023-06-02RWKV: Reinventing RNNs for the Transformer Era (Paper Explained)
2023-05-23Tree of Thoughts: Deliberate Problem Solving with Large Language Models (Full Paper Review)



Tags:
deep learning
machine learning
arxiv
explained
neural networks
ai
artificial intelligence
paper
prompt engineering
think step by step
deepmind
google deepmind
google brain
prompts
llm prompts
automatic prompts
llm prmpt engineering