Reward Hacking: Concrete Problems in AI Safety Part 3

Channel:

Robert Miles AI Safety

Subscribers:

156,000

Published on August 12, 2017 7:24:08 PM ● Video Link: https://www.youtube.com/watch?v=92qDfT8pENs

Duration: 6:56

98,639 views

4,442

Sometimes AI can find ways to 'cheat' and get more reward than we intended by doing something unexpected.

The Concrete Problems in AI Safety Playlist: https://www.youtube.com/playlist?list=PLqL14ZxTTA4fEp5ltiNinNHdkPuLK4778
The Computerphile video: https://www.youtube.com/watch?v=9nktr1MgS-A
The paper 'Concrete Problems in AI Safety': https://arxiv.org/pdf/1606.06565.pdf

SethBling's channel: https://www.youtube.com/user/sethbling

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Jordan Medina
FHI's own Kyle Scott
Jason Hise
David Rasmussen
James McCuen
Richárd Nagyfi
Ammar Mousali
Joshua Richardson
Fabian Consiglio
Jonatan R
Øystein Flygt
Björn Mosten
Michael Greve
robertvanduursen
The Guru Of Vision
Fabrizio Pisani
Alexander Hartvig Nielsen
Volodymyr
David Tjäder
Paul Mason
Ben Scanlon
Julius Brash
Mike Bird
Peggy Youell
Konstantin Shabashov
Almighty Dodd
DGJono
Matthias Meger
Scott Stevens
Emilio Alvarez
Benjamin Aaron Degenhart
Michael Ore
Robert Bridges
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Lo Rez
C3POehne
Stephen Paul
Marcel Ward
Andrew Weir
Pontus Carlsson
Taylor Smith
Ben Archer
Ivan Pochesnev
Scott McCarthy
Kabilan Kabilan Kabilan Kabilan
Phil
Philip Alexander
Christopher
Tendayi Mawushe
Gabriel Behm
Anne Kohlbrenner
Jake Fish
Jennifer Autumn Latham

Other Videos By Robert Miles AI Safety

2018-03-24	Why Would AI Want to do Bad Things? Instrumental Convergence
2018-02-13	Superintelligence Mod for Civilization V
2018-01-11	Intelligence and Stupidity: The Orthogonality Thesis
2017-11-29	Scalable Supervision: Concrete Problems in AI Safety Part 5
2017-11-16	AI Safety at EAGlobal2017 Conference
2017-10-29	AI learns to Create ̵K̵Z̵F̵ ̵V̵i̵d̵e̵o̵s̵ Cat Pictures: Papers in Two Minutes #1
2017-10-17	What can AGI do? I/O and Speed
2017-09-24	What Can We Do About Reward Hacking?: Concrete Problems in AI Safety Part 4
2017-08-29	Reward Hacking Reloaded: Concrete Problems in AI Safety Part 3.5
2017-08-22	The other "Killer Robot Arms Race" Elon Musk should worry about
2017-08-12	Reward Hacking: Concrete Problems in AI Safety Part 3
2017-07-22	Why Not Just: Raise AI Like Kids?
2017-07-09	Empowerment: Concrete Problems in AI Safety part 2
2017-06-25	Avoiding Positive Side Effects: Concrete Problems in AI Safety part 1.5
2017-06-18	Avoiding Negative Side Effects: Concrete Problems in AI Safety part 1
2017-06-17	Robert Miles Live Stream
2017-06-10	Are AI Risks like Nuclear Risks?
2017-05-27	Respectability
2017-05-18	Predicting AI: RIP Prof. Hubert Dreyfus
2017-04-27	What's the Use of Utility Functions?
2017-03-31	Where do we go now?

Tags:

AGI

artificial intelligence

artificial general intelligence

AI Safety

AI Risk

Elon Musk

Deep Mind

hacking

deepmind

reinforcement learning

deep reinforcement learning

Channel	Latest
LANDAN2006	6 hours ago
Сергей Холостяков	6 hours ago
Killa	7 hours ago
TannerOfTheNorth	7 hours ago
Drameloch	7 hours ago
EL TUCU SAMY	7 hours ago
Tsurou Games	7 hours ago
runJDrun	7 hours ago
Kang Movie	7 hours ago
Nick28T	7 hours ago
MaxWriter	7 hours ago
Pants are Dragon	7 hours ago
Bia e Léo	7 hours ago
NotRealName NotAtAll	7 hours ago
Toxsick	7 hours ago
RYDER	7 hours ago
Nintendo Files	7 hours ago
Lord N-Zo	7 hours ago
Caith_Sith	7 hours ago
GregzVR	7 hours ago
Flik's Gaming Stuff	7 hours ago
BestGamerHD	8 hours ago
Mizylo	8 hours ago
GoldGlove Let's Plays	8 hours ago
RajmanGaming HD	8 hours ago