Asynchronous Methods for Deep Reinforcement Learning: MuJoCo

Subscribers:
663,000
Published on ● Video Link: https://www.youtube.com/watch?v=Ajjc08-iPx8



Duration: 1:23
33,694 views
346


The video shows agents trained using the Asynchronous Advantage Actor-Critic (A3C) algorithm performing a variety of motor control tasks. The tasks successfully learned by the agents include pole swing-up, quadruped locomotion, planar biped walking, balancing, 2D target reaching, and 3D manipulation. Paper link - http://arxiv.org/pdf/1602.01783.pdf