OpenAI Gym
Nav
  • Home
  • Environments
  • Documentation
  • Forum
  • Close
  • Sign in with GitHub
CartPole-v0

algorithm on CartPole-v0

  • 2017-09-10 15:51:02.713221
hzwer

Learning performance

Solved after 1894 episodes. Best 100-episode average reward was 197.32 ± 0.79. (CartPole-v0 is considered "solved" when the agent obtains an average reward of at least 195.0 over 100 consecutive episodes.)

1894
Episodes to solve
2000
Total episodes
Solved
9m
Time to solve
Download
Tweet
  • Environments
  • Documentation
  • Forum
  • Credits
OpenAI