OpenAI Gym
Nav
  • Home
  • Environments
  • Documentation
  • Forum
  • Close
  • Sign in with GitHub
Copy-v0

algorithm on Copy-v0

  • Writeup
  • 2016-08-30 02:59:28.384305
tilarids

Learning performance

Solved after 13937 episodes. Best 100-episode average reward was 31.20 ± 0.08. (Copy-v0 is considered "solved" when the agent obtains an average reward of at least 25.0 over 100 consecutive episodes.)

13937
Episodes to solve
18324
Total episodes
Solved
33s
Time to solve
Download
Tweet
How to reproduce
Comments

Comment on GitHub

Comment on GitHub

  • Environments
  • Documentation
  • Forum
  • Credits
OpenAI