OpenAI Gym
Nav
  • Home
  • Environments
  • Documentation
  • Forum
  • Close
  • Sign in with GitHub
CartPole-v0

algorithm on CartPole-v0

  • 2017-09-09 08:04:35.381597
sonic1sonic

Learning performance

Solved after 185 episodes. Best 100-episode average reward was 200.00 ± 0.00. (CartPole-v0 is considered "solved" when the agent obtains an average reward of at least 195.0 over 100 consecutive episodes.)

185
Episodes to solve
500
Total episodes
Solved
4s
Time to solve
Download
Tweet
  • Environments
  • Documentation
  • Forum
  • Credits
OpenAI