Advertisement
Guest User

Untitled

a guest
Apr 23rd, 2017
76
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.39 KB | None | 0 0
  1. import gym
  2.  
  3. def main():
  4. env = gym.make('CartPole-v1')
  5. env.render()
  6. obs = env.reset()
  7. reward = 0
  8. while True:
  9. obs, rew, done, _ = env.step(policy(obs))
  10. env.render()
  11. reward += rew
  12. if done:
  13. break
  14. print('Total reward of %f' % reward)
  15.  
  16. def policy(obs):
  17. if obs[2] < -0.012419 or obs[3] < -0.091290:
  18. return 0
  19. return 1
  20.  
  21. main()
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement