Challenges in Making LLMs Safe and Robust

Channel:

Simons Institute for the Theory of Computing

Subscribers:

69,500

Published on September 6, 2024 12:00:00 AM ● Video Link: https://www.youtube.com/watch?v=0_Ui4y_v0Mw

Duration: 0:00

307 views

Aditi Raghunathan (Carnegie Mellon University)
https://simons.berkeley.edu/talks/aditi-raghunathan-carnegie-mellon-university-2024-09-06
Special Year on Large Language Models and Transformers, Part 1 Boot Camp

Aditi Raghunathan (CMU)’s presentation in the Large Language Models and Transformers, Part I Boot Camp addresses the root causes of numerous safety concerns and wide-ranging attacks on current large language models. Using a simple illustrative problem, she walks us through several defense strategies and evaluates their strengths and weaknesses, drawing connections to the broader literature on safety and robustness in machine learning.

Other Videos By Simons Institute for the Theory of Computing

2024-10-02	The long path to \sqrt{d} monotonicity testers
2024-10-02	O(log log n) Passes is Optimal for Semi-Streaming Maximal Independent Set
2024-10-02	Distribution Learning Meets Graph Structure Sampling
2024-10-02	On the instance optimality of detecting collisions and subgraphs
2024-10-02	Low Degree Testing over the Reals
2024-10-02	Talk By Tal Herman
2024-10-02	Verifiable Data Science via Interactive Proofs
2024-10-02	Dana is NOT average
2024-10-02	Monotonicity testing, routing, and a theorem of Lehman and Ron
2024-10-02	Conditional Sampling for Distribution Testing
2024-09-05	Challenges in Making LLMs Safe and Robust
2024-09-04	Understanding and Steering Generative AI Systems
2024-08-04	On counting subgraphs and why counting seeds makes more sense (if one thinks about it clearly)
2024-08-04	Testing Intersectingness of Uniform Families or how Dana and I intersected
2024-08-04	Recent Developments in Testing Bounded-Degree Graphs
2024-08-04	Property Testing with Incomplete or Manipulated Inputs
2024-08-01	Are there graphs whose shortest path structure requires large edge weights?
2024-08-01	Graph Connectivity Using Star Contraction
2024-08-01	Agnostic Proper Learning of Monotone Functions: Beyond the Black-Box Correction Barrier
2024-08-01	Sparsifying Set Systems for Coverage Problems
2024-07-31	Recent Progress on Euclidean Spanners

Channel	Latest
るす	6 hours ago
Fafalo Fery	6 hours ago
Jackson Play	6 hours ago
JIMBOD	6 hours ago
zaklepka1	6 hours ago
Zalgiris Kaunas	6 hours ago
Jakeszee	6 hours ago
Rader Gaming	6 hours ago
DODO -PLAYER	6 hours ago
るあ	6 hours ago
Brilio News	6 hours ago
SketchUp	6 hours ago
咕叽沙雕动画	6 hours ago
Dane Nerro	7 hours ago
Sevilla FC	7 hours ago
SmokinReact	7 hours ago
MMOJACKX57	7 hours ago
Camille Baranda	7 hours ago
Eintracht Frankfurt	7 hours ago
またいち(うし) ~ ソロで遊ぶひと	7 hours ago
Fire Sim Nerd	7 hours ago
Hades Anokata	7 hours ago
Shotgun Chanel	7 hours ago
predatelichr	7 hours ago
Boreo y Gatuna	7 hours ago