OpenAI Gym
Nav
  • Home
  • Environments
  • Documentation
  • Forum
  • Close
  • Sign in with GitHub
CartPole-v0

algorithm on CartPole-v0

  • Writeup
  • 2017-09-11 06:38:13.848282
mbalunovic

Learning performance

Solved after 306 episodes. Best 100-episode average reward was 200.00 ± 0.00. (CartPole-v0 is considered "solved" when the agent obtains an average reward of at least 195.0 over 100 consecutive episodes.)

306
Episodes to solve
1001
Total episodes
Solved
12m
Time to solve
Download
Tweet
How to reproduce
Comments

Comment on GitHub

Comment on GitHub

  • Environments
  • Documentation
  • Forum
  • Credits
OpenAI