Assessing Game Balance with AlphaZero: Exploring Alternative Rule Sets in Chess (Paper Explained)

Channel:

Yannic Kilcher

Subscribers:

300,000

Published on September 13, 2020 7:56:31 PM ● Video Link: https://www.youtube.com/watch?v=O1b0cbgpRBw

Category:

Let's Play

Duration: 42:41

5,966 views

199

#ai #chess #alphazero

Chess is a very old game and both its rules and theory have evolved over thousands of years in the collective effort of millions of humans. Therefore, it is almost impossible to predict the effect of even minor changes to the game rules, because this collective process cannot be easily replicated. This paper proposes to use AlphaZero's ability to achieve superhuman performance in board games within one day of training to assess the effect of a series of small, but consequential rule changes. It analyzes the resulting strategies and sets the stage for broader applications of reinforcement learning to study rule-based systems.

OUTLINE:
0:00 - Intro & Overview
2:30 - Alternate Chess Rules
4:20 - Using AlphaZero to assess rule change outcomes
6:00 - How AlphaZero works
16:40 - Alternate Chess Rules continued
18:50 - Game outcome distributions
31:45 - e4 and Nf3 in classic vs no-castling chess
36:40 - Conclusions & comments

Paper: https://arxiv.org/abs/2009.04374

My Video on AI Economist: https://youtu.be/F5aaXrIMWyU

Abstract:
It is non-trivial to design engaging and balanced sets of game rules. Modern chess has evolved over centuries, but without a similar recourse to history, the consequences of rule changes to game dynamics are difficult to predict. AlphaZero provides an alternative in silico means of game balance assessment. It is a system that can learn near-optimal strategies for any rule set from scratch, without any human supervision, by continually learning from its own experience. In this study we use AlphaZero to creatively explore and design new chess variants. There is growing interest in chess variants like Fischer Random Chess, because of classical chess's voluminous opening theory, the high percentage of draws in professional play, and the non-negligible number of games that end while both players are still in their home preparation. We compare nine other variants that involve atomic changes to the rules of chess. The changes allow for novel strategic and tactical patterns to emerge, while keeping the games close to the original. By learning near-optimal strategies for each variant with AlphaZero, we determine what games between strong human players might look like if these variants were adopted. Qualitatively, several variants are very dynamic. An analytic comparison show that pieces are valued differently between variants, and that some variants are more decisive than classical chess. Our findings demonstrate the rich possibilities that lie beyond the rules of modern chess.

Authors: Nenad Tomašev, Ulrich Paquet, Demis Hassabis, Vladimir Kramnik

Links:
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
Discord: https://discord.gg/4H8xxDF
BitChute: https://www.bitchute.com/channel/yannic-kilcher
Minds: https://www.minds.com/ykilcher
Parler: https://parler.com/profile/YannicKilcher
LinkedIn: https://www.linkedin.com/in/yannic-kilcher-488534136/

If you want to support me, the best thing to do is to share out the content :)

If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this):
SubscribeStar: https://www.subscribestar.com/yannickilcher
Patreon: https://www.patreon.com/yannickilcher
Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq
Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2
Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m
Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n

Other Videos By Yannic Kilcher

2020-11-22	Fourier Neural Operator for Parametric Partial Differential Equations (Paper Explained)
2020-11-15	[News] Soccer AI FAILS and mixes up ball and referee's bald head.
2020-11-10	Underspecification Presents Challenges for Credibility in Modern Machine Learning (Paper Explained)
2020-11-02	Language Models are Open Knowledge Graphs (Paper Explained)
2020-10-26	Rethinking Attention with Performers (Paper Explained)
2020-10-17	LambdaNetworks: Modeling long-range Interactions without Attention (Paper Explained)
2020-10-11	Descending through a Crowded Valley -- Benchmarking Deep Learning Optimizers (Paper Explained)
2020-10-04	An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Paper Explained)
2020-10-03	Training more effective learned optimizers, and using them to train themselves (Paper Explained)
2020-09-18	The Hardware Lottery (Paper Explained)
2020-09-13	Assessing Game Balance with AlphaZero: Exploring Alternative Rule Sets in Chess (Paper Explained)
2020-09-07	Learning to summarize from human feedback (Paper Explained)
2020-09-02	Self-classifying MNIST Digits (Paper Explained)
2020-08-28	Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation (Paper Explained)
2020-08-26	Radioactive data: tracing through training (Paper Explained)
2020-08-23	Fast reinforcement learning with generalized policy updates (Paper Explained)
2020-08-20	What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study (Paper Explained)
2020-08-18	[Rant] REVIEWER #2: How Peer Review is FAILING in Machine Learning
2020-08-14	REALM: Retrieval-Augmented Language Model Pre-Training (Paper Explained)
2020-08-12	Meta-Learning through Hebbian Plasticity in Random Networks (Paper Explained)
2020-08-09	Hopfield Networks is All You Need (Paper Explained)

Tags:

deep learning

machine learning

arxiv

explained

neural networks

artificial intelligence

paper

deepmind

chess

kramnik

fide

rules

alphago

alpha go

alphazero

alpha zero

mu zero

muzero

google

reinforcement learning

mcts

rule change

other rules

alternate rules

torpedo

no castling

pawn sideways

self capture

entropy

opening theory

rule based systems

berlin defense

opening

stalemate

deep rl

deep reinforcement learning

alphazero chess

alphazero analysis

Channel	Latest
MrT-Gaming	7 hours ago
The Nishant Vibe	7 hours ago
atv	7 hours ago
TerraChannel / TerraFox	7 hours ago
LukePingu	7 hours ago
Taffe316	7 hours ago
RapCheck	7 hours ago
SOLO GAMER	7 hours ago
Olympus	8 hours ago
Gellar Gaiden	8 hours ago
JÚNIOR GAELZIN	8 hours ago
DIOSTAR GAMER	8 hours ago
RUTAX FREESTYLE	8 hours ago
Loster99	8 hours ago
NS_ART	8 hours ago
Power Art YT	8 hours ago
iin indra wicahya	8 hours ago
TechBag	8 hours ago
milkcat 밀캣 (밀크캣)	8 hours ago
imjinxss	8 hours ago
Gauging Gadgets	8 hours ago
Sonic Plasma	8 hours ago
JSChels	8 hours ago
Boom Logo Effects	8 hours ago
DIGITAL UNDERGROUND GAMING	8 hours ago