LLMs can and do literally lie on purpose

Channel:

Subscribers:

Published on November 11, 2024 5:46:02 PM ● Video Link: https://www.youtube.com/watch?v=3HFrrL4-5ps

Duration: 0:00

623 views

It is recklessly dangerous to trust an LLM to help inform decisions in nearly ANY context - even a simple chatbot - without verifiable proof of the system prompt(s) and everything else that might be "in context", including fine-tuning guidelines/prompts, training techniques, and pre-training dataset content and usage. Even then, that's only the start of the problems. 🧾SOURCES:

1. hhttps://arxiv.org/abs/2311.07590- Large Language Models can Strategically Deceive their Users when Put Under Pressure

2. hhttps://www.sciencedirect.com/science/article/pii/S0267364922000292- The flaws of policies requiring human oversight of government algorithms

3. The stuff about "we need a new model of how oversight works" is mostly made up by Claude*. It SOUNDS nice, but that's not the same as checking whether it actually works. Remember, LLMs can and do literally lie- they complete the patterns their training and prompting leads them to. It feels likely, for one example, that Gemini and Claude have system prompting along the lines of "stay positive about the benefits of AI", for example, which would "color" everything they generate. Or, maybe there's as explicit as that, but they do AVOID including directions like "always warn people about the risks of AI", because they worry maybe that will make their product sell less well. The point is, we need to know all this stuff, AND more, for safe use.

---

Script and audio generated by NotebookLM: hhttps://notebooklm.google.com/Based on this essay generated by Claude: hhttps://pastebin.com/XAUMnBMV(I proofread it compared to the papers checking for any obvious mistakes, but I am not an AI researcher)
Visualized by MAZTR: hhttps://www.maztr.com/audiofilevisualizer
*Well, "made up" doesn't really apply to what LLMs do; it's not coming from nowhere, it comes a little bit from the papers, a little bit from how I prompted Claude to write the essay, and a little bit from what Claude is trained on; so in a way "we" made it up.

Other Videos By T

2024-11-11	LLMs can and do literally lie on purpose
2024-11-03	01 - Dying Dragon 🌠 Miss Macross: My Life as The Star - A Robotech Rock Opera by Lynn Minmay
2024-10-10	Introduction to Spellbook & RhythML \| VCV Rack
2024-06-09	jungle
2024-04-20	Spellbook Demo
2024-04-18	Spellbook - Tech Demo
2024-04-14	Watching the Warp Trails
2023-12-01	Rainfall
2023-10-15	Hades
2023-10-04	Phasor Array
2023-09-30	The Heart Machine
2023-09-22	The Road to Death Mountain
2023-09-21	Pulling in to Orbit
2023-09-20	NausicaA
2023-09-20	Downtown Backstreets
2023-09-17	Ghost Song
2023-09-16	Seeded Orchestra
2023-09-09	Recursive Seed
2023-09-08	NaissanceE
2023-08-22	Heyman
2022-04-03	OrangeLine DM Composer 009

Channel	Latest
伍妞有伍仔	6 hours ago
ЭЛЕКТРОСИЛА	6 hours ago
阿紅RedKai_遊戲頻道	6 hours ago
TENET Productions	6 hours ago
10 well	6 hours ago
Shotgun Chanel	6 hours ago
Gazeta Sporturilor	6 hours ago
Shap	6 hours ago
RILE	6 hours ago
TheRevPlays	6 hours ago
Brilio News	6 hours ago
GamingDose	6 hours ago
SRafaah	6 hours ago
Philadelphia Union	6 hours ago
Krow's Graveyard	6 hours ago
Space Battles 2020	6 hours ago
UNWIRE.HK	7 hours ago
iliasGaming	7 hours ago
DAZN Portugal	7 hours ago
舞亜	7 hours ago
Greyshot Productions	7 hours ago
Der Wilde Roland	7 hours ago
Softwerker	7 hours ago
ひろちゃんねる	7 hours ago
AlphaSniper97	7 hours ago