OpenAI Gym
Nav
  • Home
  • Environments
  • Documentation
  • Forum
  • Close
  • Sign in with GitHub
CartPole-v0

algorithm on CartPole-v0

  • 2017-09-10 17:36:40.197322
haron1100

Learning performance

Solved after 88 episodes. Best 100-episode average reward was 199.33 ± 0.29. (CartPole-v0 is considered "solved" when the agent obtains an average reward of at least 195.0 over 100 consecutive episodes.)

88
Episodes to solve
240
Total episodes
Solved
2s
Time to solve
Download
Tweet
  • Environments
  • Documentation
  • Forum
  • Credits
OpenAI