Advertisement
Guest User

Untitled

a guest
Feb 24th, 2017
65
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.44 KB | None | 0 0
  1. Model free Q-Learning in an MDP style environment
  2.  
  3. Utilized code from Berkeley's CS188 Reinforcement Learning project
  4.  
  5. Introduced an epsilon decay to offer a transition between early exploration and late exploitation
  6.  
  7. QLearning paramters:
  8.  
  9. alpha = 0.1
  10. epsilon = 1.0
  11. gamma = .99
  12. epsilon_decay = .9995
  13. learning_decay = 1.0
  14.  
  15.  
  16. Program paramters:
  17. python openai.py -v FrozenLake-v0
  18. -a 0.1
  19. -e 1.0
  20. -g .99
  21. --learningDecay 1.0
  22. --explorationDecay .9995
  23. -x 10000
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement