Advertisement
Guest User

Untitled

a guest
Nov 14th, 2019
143
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 2.23 KB | None | 0 0
  1. - model: cartpole-ann
  2. description: Train a simple ANN on the base cartpole environment
  3. operations:
  4. train:
  5. description: Train ANN model
  6. main: snnrl/algos/vpg/vpg --save_dir data
  7. output-scalars: no
  8. flags-import-skip: [save_dir]
  9. flags:
  10. device:
  11. default: cuda
  12. gym_env:
  13. default: "ImageCartPole-v0"
  14. epochs:
  15. default: 50
  16. steps_per_epoch:
  17. default: 4000
  18. max_ep_length:
  19. default: 1000
  20. policy_lr:
  21. default: 3e-4
  22. vf_lr:
  23. default: 1e-3
  24. vf_iters:
  25. default: 400
  26. gae_gamma:
  27. default: 0.99
  28. gae_lam:
  29. default: 0.95
  30. policy_hidden:
  31. default: "32,64"
  32. vf_hidden:
  33. default: "32,64"
  34. compare:
  35. - ep_ret_mean/train as ep_ret_mean
  36. - ep_ret_var/train as ep_ret_var
  37. - ep_len_mean/train as ep_len_mean
  38. - ep_len_var/train as ep_len_var
  39. image-diff:
  40. description:
  41. Train the same model, but using the diff environment and hope
  42. it is better.
  43. steps:
  44. - run: train gym_env=ImageDiffCartPole-v0
  45.  
  46. - model: cartpole-snn
  47. description: Train a simple SNN on the image cartpole environment
  48. operations:
  49. train:
  50. description: Train SNN model
  51. main: snnrl/algos/vpg/snn_vpg --save_dir data
  52. output-scalars: no
  53. flags-import-skip: [save_dir]
  54. flags:
  55. device:
  56. default: cuda
  57. gym_env:
  58. default: "ImageCartPole-v0"
  59. epochs:
  60. default: 50
  61. steps_per_epoch:
  62. default: 4000
  63. max_ep_length:
  64. default: 1000
  65. policy_lr:
  66. default: 3e-4
  67. vf_lr:
  68. default: 1e-3
  69. vf_iters:
  70. default: 400
  71. gae_gamma:
  72. default: 0.99
  73. gae_lam:
  74. default: 0.95
  75. snn_config:
  76. default: "./configs/snn.json"
  77. requires:
  78. - file: snnrl/configs
  79. path: .
  80. compare:
  81. - ep_ret_mean/train as ep_ret_mean
  82. - ep_ret_var/train as ep_ret_var
  83. - ep_len_mean/train as ep_len_mean
  84. - ep_len_var/train as ep_len_var
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement