Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning | AISC

Channel:

LLMs Explained - Aggregate Intellect - AI.SCIENCE

Subscribers:

22,600

Published on August 13, 2020 4:47:41 AM ● Video Link: https://www.youtube.com/watch?v=OWnL8t0CF3Y

Duration: 43:08

283 views

Speaker(s): Mido Assran
Host: Susan Shu Chang

Find the recording, slides, and more info at https://ai.science/e/gossip-based-actor-learner-architectures-for-deep-reinforcement-learning--QSSRnnwOaTVF2ByuEnPk

Motivation / Abstract

Multi-simulator training has contributed to the recent success of Deep Reinforcement Learning by stabilizing learning and allowing for higher training throughputs. We propose Gossip-based Actor-Learner Architectures (GALA) where several actor-learners (such as A2C agents) are organized in a peer-to-peer communication topology, and exchange information through asynchronous gossip in order to take advantage of a large number of distributed simulators. We prove that GALA agents remain within an epsilon-ball of one-another during training when using loosely coupled asynchronous communication. By reducing the amount of synchronization between agents, GALA is more computationally efficient and scalable compared to A2C, its fully-synchronous counterpart. GALA also outperforms A2C, being more robust and sample efficient. We show that we can run several loosely coupled GALA agents in parallel on a single GPU and achieve significantly higher hardware utilization and frame-rates than vanilla A2C at comparable power draws.

------
#AISC hosts 3-5 live sessions like this on various AI research, engineering, and product topics every week! Visit https://ai.science for more details

Other Videos By LLMs Explained - Aggregate Intellect - AI.SCIENCE

2020-08-19	Discovering Symbolic Inductive Biases \| AISC
2020-08-19	Product Ideation - Art of Finding the Right Problem to Work on! \| AISC
2020-08-19	Pink Diamond - Data Driven Prediction of Venture Success \| Workshop Capstone
2020-08-19	Review Nuggets - Mining Insight from Consumer Product Reviews \| Workshop Capstone
2020-08-19	Fast Film - Emotionally Aware Movie Recommender \| Workshop Capstone
2020-08-19	Acetock - Stock Prediction Tool for Amateur Investors \| Workshop Capstone
2020-08-19	Saramsh - Patent Document Summarization using BART \| Workshop Capstone
2020-08-19	MindfulZen - Data Driven Stress Buster \| Workshop Capstone
2020-08-14	Machine Learning and the Earth: Applying AI to address some of the world’s greatest challenges
2020-08-13	Xun Wang (GEICO): 7 Job Profiles to Demystify the Data Science Career Landscape
2020-08-12	Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning \| AISC
2020-08-12	Computer v.s. Human visual system \| AISC
2020-08-12	AI Fariness and Adversarial Debiasing
2020-08-11	Joint Policy-Value Learning for Recommendation \| AISC
2020-08-11	Operationalizing the AI Canvas for AI Product Success (and profit) \| AISC
2020-08-07	Overview of Bias and Fairness in AI
2020-08-06	Subexponential-Time Algorithms for Sparse PCA \| AISC
2020-08-05	Inverse design of nanoporous crystalline reticular materials with deep generative models \| AISC
2020-08-04	ChemOS: An orchestration software to democratize autonomous discovery \| AISC
2020-07-30	Recurrent Neural Network for Quantum Wave Function \| AISC
2020-07-30	Bounded Rationality in Las Vegas: Probabilistic Finite Automata PlayMulti-Armed Bandits \| AISC

Channel	Latest
PopCross Studios	6 hours ago
RTGame	10 hours ago
Dawko	11 hours ago
MKIceAndFire	11 hours ago
IntroGameOver	11 hours ago
alanzoka	12 hours ago
oGVexx	13 hours ago
CarbotAnimations	13 hours ago
Icehiteru	15 hours ago
raocow	16 hours ago
Grimith	17 hours ago
Caner Akçay	18 hours ago
whitemoca	19 hours ago
LevelUp Legends	19 hours ago
mariey tv	19 hours ago
Yuichiro Gaming	19 hours ago
RIJEKKK	19 hours ago
상상상상	19 hours ago
69SportTV	19 hours ago
Fandy DS	19 hours ago
SAEROS ID	19 hours ago
Electronics Repair School	19 hours ago
Fuukoji	19 hours ago
PLAYzone	19 hours ago
Pyken	19 hours ago