OpenAI Gym
Nav
  • Home
  • Environments
  • Documentation
  • Forum
  • Close
  • Sign in with GitHub
Centipede-ram-v0

algorithm on Centipede-ram-v0

  • Writeup
  • 2016-04-25 06:46:03.972296
joschu

Learning performance

Best 100-episode average reward was 6099.53 ± 376.53. (Centipede-ram-v0 does not have a specified reward threshold at which it's considered solved.)

6099.53 ± 376.53
Score
3h
Total runtime
Download
Tweet

Algorithm

This evaluation was generated by running trpo-gae-v0.

How to reproduce
Comments

Comment on GitHub

Comment on GitHub

  • Environments
  • Documentation
  • Forum
  • Credits
OpenAI