OpenAI Gym
Nav
  • Home
  • Environments
  • Documentation
  • Forum
  • Close
  • Sign in with GitHub
CartPole-v0

algorithm on CartPole-v0

  • 2017-09-10 03:20:07.066602
wxl18039675170

Learning performance

Did not solve the environment. Best 100-episode average reward was 22.82 ± 1.26. (CartPole-v0 is considered "solved" when the agent obtains an average reward of at least 195.0 over 100 consecutive episodes.)

22.82 ± 1.26
Average reward
Unsolved
3s
Total runtime
Download
Tweet
  • Environments
  • Documentation
  • Forum
  • Credits
OpenAI