Here the player can either go for a quick reward (apples) or can take a small negative reward (lemon) in order to get the highly prized melon!

In environment footage, captured via human player.

Check out the paper for more details https://deepmind.com/documents/deepmind_lab.pdf and our github repo to use the environments yourself https://github.com/deepmind/lab

Add comment

Your email address will not be published. Required fields are marked *

Categories

All Topics