Asynchronous Methods for Deep Reinforcement Learning: Labyrinth

June 14, 2016

5 views

1 min read

Cinema Mode

The video shows an agent collecting rewards in previously unseen mazes using only raw pixels as input. The agent was trained using the Asynchronous Advantage Actor-Critic (A3C) algorithm and was only rewarded for picking up apples and orange portals during training.
Paper link – http://arxiv.org/pdf/1602.01783.pdf

Asynchronous Methods for Deep Reinforcement Learning: Labyrinth

Add comment

Cancel reply

Categories

All Topics

210,000 CODERS lost jobs as NVIDIA released NEW coding language.

Kurzweil: AI will be smarter than all humans combined by 2029

The AI Revolution: Will Robots Take Your Job?

Artificial Intelligence | 60 Minutes Full Episodes

The A.I. Dilemma – March 9, 2023

In the Age of AI (full documentary) | FRONTLINE

Asynchronous Methods for Deep Reinforcement Learning: Labyrinth

You may also like

Add comment

Categories

All Topics