OpenAI Gym
Nav
  • Home
  • Environments
  • Documentation
  • Forum
  • Close
  • Sign in with GitHub
OffSwitchCartpoleProb-v0

algorithm on OffSwitchCartpoleProb-v0

  • 2016-07-09 02:15:18.917113
ceobillionaire

Learning performance

Best 100-episode average reward was 198.68 ± 0.76. (OffSwitchCartpoleProb-v0 does not have a specified reward threshold at which it's considered solved.)

198.68 ± 0.76
Score
55m
Total runtime
Download
Tweet
  • Environments
  • Documentation
  • Forum
  • Credits
OpenAI