Asynchronous Methods for Deep Reinforcement Learning: TORCS

Subscribers:
663,000
Published on ● Video Link: https://www.youtube.com/watch?v=0xo1Ldx3L5Q



Duration: 0:30
46,998 views
335


The video shows an agent driving a racecar using only raw pixels as input. The agent was trained using the Asynchronous Advantage Actor-Critic (A3C) algorithm. During training, the agent was rewarded for maintaining high velocity along the center of the racetrack.
Paper link - http://arxiv.org/pdf/1602.01783.pdf