The video shows agents trained using the Asynchronous Advantage Actor-Critic (A3C) algorithm performing a variety of motor control tasks. The tasks successfully learned by the agents include pole swing-up, quadruped locomotion, planar biped walking, balancing, 2D target reaching, and 3D manipulation. Paper link –
source
Asynchronous Methods for Deep Reinforcement Learning: MuJoCo
Previous ArticleOpenAI Spinning Up in Deep RL Workshop