What Can We Do About Reward Hacking?: Concrete Problems in AI Safety Part 4

Channel:

Robert Miles AI Safety

Subscribers:

156,000

Published on September 24, 2017 12:09:54 PM ● Video Link: https://www.youtube.com/watch?v=13tZ9Yia71c

Duration: 9:38

109,736 views

4,699

Three different approaches that might help to prevent reward hacking.

New Side Channel with no content yet!: https://www.youtube.com/channel/UC4qH2AHly_RSRze1bUqSSNw
Where do we go now?: https://www.youtube.com/watch?v=vYhErnZdnso
Previous Video in the series: https://youtu.be/46nsTFfsBuc

The Concrete Problems in AI Safety Playlist: https://www.youtube.com/playlist?list=PLqL14ZxTTA4fEp5ltiNinNHdkPuLK4778
The Computerphile video: https://www.youtube.com/watch?v=4l7Is6vOAOA
The paper 'Concrete Problems in AI Safety': https://arxiv.org/pdf/1606.06565.pdf

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Steef
Sara Tjäder
Jason Strack
Chad Jones
Stefan Skiles
Katie Byrne
Ziyang Liu
Jordan Medina
Kyle Scott
Jason Hise
David Rasmussen
Heavy Empty
James McCuen
Richárd Nagyfi
Ammar Mousali
Scott Zockoll
Charles Miller
Joshua Richardson
Fabian Consiglio
Jonatan R
Øystein Flygt
Björn Mosten
Michael Greve
robertvanduursen
The Guru Of Vision
Fabrizio Pisani
A Hartvig Nielsen
Volodymyr
David Tjäder
Paul Mason
Ben Scanlon
Julius Brash
Mike Bird
Taylor Winning
Roman Nekhoroshev
Peggy Youell
Konstantin Shabashov
Dodd Almighty
DGJono
Matthias Meger
Scott Stevens
Emilio Alvarez
Michael Ore
Robert Bridges
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Lo Rez
C3POehne
Stephen Paul
Marcel Ward
Andrew Weir
Pontus Carlsson
Taylor Smith
Ben Archer
Ivan Pochesnev
Scott McCarthy
Kabs Kabs Kabs
Phil
Philip Alexander
Christopher
Tendayi Mawushe
Gabriel Behm
Anne Kohlbrenner
Jake Fish
Jennifer Autumn Latham
Filip
Bjorn Nyblad
Stefan Laurie
Tom O'Connor
Krethys
PiotrekM
Jussi Männistö
Matanya Loewenthal
Wr4thon

Other Videos By Robert Miles AI Safety

2018-06-24	Friend or Foe? AI Safety Gridworlds extra bit
2018-05-25	AI Safety Gridworlds
2018-03-31	Experts' Predictions about the Future of AI
2018-03-24	Why Would AI Want to do Bad Things? Instrumental Convergence
2018-02-13	Superintelligence Mod for Civilization V
2018-01-11	Intelligence and Stupidity: The Orthogonality Thesis
2017-11-29	Scalable Supervision: Concrete Problems in AI Safety Part 5
2017-11-16	AI Safety at EAGlobal2017 Conference
2017-10-29	AI learns to Create ̵K̵Z̵F̵ ̵V̵i̵d̵e̵o̵s̵ Cat Pictures: Papers in Two Minutes #1
2017-10-17	What can AGI do? I/O and Speed
2017-09-24	What Can We Do About Reward Hacking?: Concrete Problems in AI Safety Part 4
2017-08-29	Reward Hacking Reloaded: Concrete Problems in AI Safety Part 3.5
2017-08-22	The other "Killer Robot Arms Race" Elon Musk should worry about
2017-08-12	Reward Hacking: Concrete Problems in AI Safety Part 3
2017-07-22	Why Not Just: Raise AI Like Kids?
2017-07-09	Empowerment: Concrete Problems in AI Safety part 2
2017-06-25	Avoiding Positive Side Effects: Concrete Problems in AI Safety part 1.5
2017-06-18	Avoiding Negative Side Effects: Concrete Problems in AI Safety part 1
2017-06-17	Robert Miles Live Stream
2017-06-10	Are AI Risks like Nuclear Risks?
2017-05-27	Respectability

Tags:

AGI

Artificial Intelligence

AI risk

AI safety

robert miles

robert miles AI

concrete problems in ai safety

reward hacking

hacking

reenforcement learning

deepmind

machine learning

wireheading

Channel	Latest
GregzVR	6 hours ago
Flik's Gaming Stuff	6 hours ago
BestGamerHD	6 hours ago
RajmanGaming HD	6 hours ago
Mr. E	6 hours ago
WeJustPlayGames	6 hours ago
HDNEWS TECH future	6 hours ago
★WishingTikal★	6 hours ago
Rising-Jay	6 hours ago
tvgry	6 hours ago
Chris Spencer	6 hours ago
Tabor Hill	7 hours ago
OTF Cross	7 hours ago
Lorerunner	7 hours ago
PlayDose	7 hours ago
Sports Gaming Universe	7 hours ago
Zombr3x Music	7 hours ago
Alibabav8 Games	7 hours ago
Cultura VJ	7 hours ago
Darkchiken8	7 hours ago
David Mendez24	7 hours ago
Cane Sim Media	7 hours ago
AlternativeFire	7 hours ago
Pyres	7 hours ago
AdayCanarioWTF	7 hours ago