OpenAI Gym
Nav
  • Home
  • Environments
  • Documentation
  • Forum
  • Close
  • Sign in with GitHub
Go9x9-v0

algorithm on Go9x9-v0

  • 2016-12-29 07:41:38.765825
henrykmichalewski

Learning performance

Best 100-episode average reward was 0.99 ± 0.01. (Go9x9-v0 does not have a specified reward threshold at which it's considered solved.)

0.99 ± 0.01
Score
2m
Total runtime
Download
Tweet
  • Environments
  • Documentation
  • Forum
  • Credits
OpenAI