Guest User

Untitled

a guest
Apr 9th, 2021
38
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
  1. nohup: ignoring input
  2. [I 2021-04-06 22:02:35,181] A new study created in memory with name: optuna
  3. 2021-04-06 22:02:35,188 INFO worker.py:654 -- Connecting to existing Ray cluster at address: 10.128.0.18:6379
  4. 2021-04-06 22:02:35,241 WARNING function_runner.py:545 -- Function checkpointing is disabled. This may result in unexpected behavior when using checkpointing features or certain schedulers. To enable, set the train function arguments to be `func(config, checkpoint_dir=None)`.
  5. == Status ==
  6. Memory usage on this node: 1.2/22.1 GiB
  7. Using FIFO scheduling algorithm.
  8. Resources requested: 0/6 CPUs, 0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  9. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  10. Number of trials: 1/100 (1 PENDING)
  11.  
  12.  
  13. Trial train_cb329ede reported mean_reward=-131.23800454200708 with parameters={'n_epochs': 28, 'gamma': 0.9208008749298612, 'ent_coef': 0.07838595977369499, 'learning_rate': 0.00024377717754597446, 'vf_coef': 0.4063916805295914, 'gae_lambda': 0.9080348648918256, 'max_grad_norm': 3.3720863271450323, 'n_steps': 128, 'batch_size': 64, 'n_envs': 4, 'clip_range': 4.4916679570111}.
  14. == Status ==
  15. Memory usage on this node: 3.7/22.1 GiB
  16. Using FIFO scheduling algorithm.
  17. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  18. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  19. Number of trials: 1/100 (1 RUNNING)
  20.  
  21.  
  22. Trial train_cb329ede completed. Last result: mean_reward=-131.23800454200708
  23. == Status ==
  24. Memory usage on this node: 3.5/22.1 GiB
  25. Using FIFO scheduling algorithm.
  26. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  27. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  28. Number of trials: 2/100 (1 RUNNING, 1 TERMINATED)
  29.  
  30.  
  31. Trial train_587a9c0e reported mean_reward=-747.1447419311 with parameters={'n_epochs': 10, 'gamma': 0.9409558869890438, 'ent_coef': 0.09354804296077848, 'learning_rate': 3.566834690355431e-05, 'vf_coef': 0.31884445314719273, 'gae_lambda': 0.9066677687619378, 'max_grad_norm': 0.44119894428950707, 'n_steps': 2048, 'batch_size': 64, 'n_envs': 8, 'clip_range': 1.5987089847642335}.
  32. == Status ==
  33. Memory usage on this node: 3.7/22.1 GiB
  34. Using FIFO scheduling algorithm.
  35. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  36. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  37. Number of trials: 2/100 (1 RUNNING, 1 TERMINATED)
  38.  
  39.  
  40. Trial train_587a9c0e completed. Last result: mean_reward=-747.1447419311
  41. == Status ==
  42. Memory usage on this node: 3.5/22.1 GiB
  43. Using FIFO scheduling algorithm.
  44. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  45. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  46. Number of trials: 3/100 (1 RUNNING, 2 TERMINATED)
  47.  
  48.  
  49. Trial train_2467d8e0 reported mean_reward=-575.1108521857764 with parameters={'n_epochs': 19, 'gamma': 0.9927378898574789, 'ent_coef': 0.0029549299244529797, 'learning_rate': 4.6406339680686644e-05, 'vf_coef': 0.9734658735035873, 'gae_lambda': 0.8029049212061022, 'max_grad_norm': 0.022285111579608642, 'n_steps': 4096, 'batch_size': 32, 'n_envs': 2, 'clip_range': 1.1308224052869966}.
  50. == Status ==
  51. Memory usage on this node: 3.8/22.1 GiB
  52. Using FIFO scheduling algorithm.
  53. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  54. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  55. Number of trials: 3/100 (1 RUNNING, 2 TERMINATED)
  56.  
  57.  
  58. Trial train_2467d8e0 completed. Last result: mean_reward=-575.1108521857764
  59. == Status ==
  60. Memory usage on this node: 3.6/22.1 GiB
  61. Using FIFO scheduling algorithm.
  62. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  63. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  64. Number of trials: 4/100 (1 RUNNING, 3 TERMINATED)
  65.  
  66.  
  67. Trial train_929317f6 reported mean_reward=-725.4147631028152 with parameters={'n_epochs': 5, 'gamma': 0.9026014570728472, 'ent_coef': 0.01185128893399963, 'learning_rate': 7.086151452253238e-05, 'vf_coef': 0.5025901486338282, 'gae_lambda': 0.9725369111603837, 'max_grad_norm': 1.9069490545616843, 'n_steps': 512, 'batch_size': 128, 'n_envs': 4, 'clip_range': 0.5710767833464744}.
  68. == Status ==
  69. Memory usage on this node: 3.8/22.1 GiB
  70. Using FIFO scheduling algorithm.
  71. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  72. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  73. Number of trials: 4/100 (1 RUNNING, 3 TERMINATED)
  74.  
  75.  
  76. Trial train_929317f6 completed. Last result: mean_reward=-725.4147631028152
  77. == Status ==
  78. Memory usage on this node: 3.6/22.1 GiB
  79. Using FIFO scheduling algorithm.
  80. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  81. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  82. Number of trials: 5/100 (1 RUNNING, 4 TERMINATED)
  83.  
  84.  
  85. Trial train_d62efdb4 reported mean_reward=-666.8855130077344 with parameters={'n_epochs': 29, 'gamma': 0.904328783503791, 'ent_coef': 0.0041722256942914315, 'learning_rate': 4.9200815567235365e-05, 'vf_coef': 0.8324089498584022, 'gae_lambda': 0.9619214943284222, 'max_grad_norm': 0.6483644166834951, 'n_steps': 4096, 'batch_size': 64, 'n_envs': 8, 'clip_range': 0.5255197235122465}.
  86. == Status ==
  87. Memory usage on this node: 3.8/22.1 GiB
  88. Using FIFO scheduling algorithm.
  89. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  90. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  91. Number of trials: 5/100 (1 RUNNING, 4 TERMINATED)
  92.  
  93.  
  94. Trial train_d62efdb4 completed. Last result: mean_reward=-666.8855130077344
  95. == Status ==
  96. Memory usage on this node: 3.5/22.1 GiB
  97. Using FIFO scheduling algorithm.
  98. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  99. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  100. Number of trials: 6/100 (1 RUNNING, 5 TERMINATED)
  101.  
  102.  
  103. Trial train_12450bd0 reported mean_reward=-515.8101825047949 with parameters={'n_epochs': 4, 'gamma': 0.9264480859953559, 'ent_coef': 0.09888414074175368, 'learning_rate': 0.00021353130691370085, 'vf_coef': 0.8190135747804081, 'gae_lambda': 0.987473853510914, 'max_grad_norm': 0.15330445760802, 'n_steps': 512, 'batch_size': 128, 'n_envs': 4, 'clip_range': 0.7822732380246952}.
  104. == Status ==
  105. Memory usage on this node: 3.9/22.1 GiB
  106. Using FIFO scheduling algorithm.
  107. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  108. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  109. Number of trials: 6/100 (1 RUNNING, 5 TERMINATED)
  110.  
  111.  
  112. Trial train_12450bd0 completed. Last result: mean_reward=-515.8101825047949
  113. == Status ==
  114. Memory usage on this node: 3.4/22.1 GiB
  115. Using FIFO scheduling algorithm.
  116. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  117. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  118. Number of trials: 7/100 (1 RUNNING, 6 TERMINATED)
  119.  
  120.  
  121. Trial train_4f8f4bac reported mean_reward=-367.40078629461095 with parameters={'n_epochs': 6, 'gamma': 0.9625505106830452, 'ent_coef': 0.08371541626299007, 'learning_rate': 2.2259750862440304e-05, 'vf_coef': 0.5919794293806709, 'gae_lambda': 0.8851230247038313, 'max_grad_norm': 0.14011078791160458, 'n_steps': 2048, 'batch_size': 256, 'n_envs': 4, 'clip_range': 1.702862127551212}.
  122. == Status ==
  123. Memory usage on this node: 3.8/22.1 GiB
  124. Using FIFO scheduling algorithm.
  125. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  126. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  127. Number of trials: 7/100 (1 RUNNING, 6 TERMINATED)
  128.  
  129.  
  130. Trial train_4f8f4bac completed. Last result: mean_reward=-367.40078629461095
  131. == Status ==
  132. Memory usage on this node: 3.4/22.1 GiB
  133. Using FIFO scheduling algorithm.
  134. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  135. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  136. Number of trials: 8/100 (1 RUNNING, 7 TERMINATED)
  137.  
  138.  
  139. Trial train_05f6eaa0 reported mean_reward=-557.8676138391057 with parameters={'n_epochs': 25, 'gamma': 0.9428404130957041, 'ent_coef': 0.006055945743195439, 'learning_rate': 4.1305210345635324e-05, 'vf_coef': 0.21865683016229348, 'gae_lambda': 0.8547134674500554, 'max_grad_norm': 2.2272110059180545, 'n_steps': 128, 'batch_size': 256, 'n_envs': 8, 'clip_range': 2.161152723526277}.
  140. == Status ==
  141. Memory usage on this node: 3.8/22.1 GiB
  142. Using FIFO scheduling algorithm.
  143. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  144. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  145. Number of trials: 8/100 (1 RUNNING, 7 TERMINATED)
  146.  
  147.  
  148. Trial train_05f6eaa0 completed. Last result: mean_reward=-557.8676138391057
  149. == Status ==
  150. Memory usage on this node: 3.4/22.1 GiB
  151. Using FIFO scheduling algorithm.
  152. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  153. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  154. Number of trials: 9/100 (1 RUNNING, 8 TERMINATED)
  155.  
  156.  
  157. Trial train_0d026a14 reported mean_reward=-353.973561682491 with parameters={'n_epochs': 30, 'gamma': 0.9853126801731658, 'ent_coef': 0.0015959034947565314, 'learning_rate': 5.725703098457397e-06, 'vf_coef': 0.14252884462824258, 'gae_lambda': 0.9594912915123677, 'max_grad_norm': 0.07635667656761054, 'n_steps': 256, 'batch_size': 32, 'n_envs': 8, 'clip_range': 2.743715763530861}.
  158. == Status ==
  159. Memory usage on this node: 3.8/22.1 GiB
  160. Using FIFO scheduling algorithm.
  161. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  162. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  163. Number of trials: 9/100 (1 RUNNING, 8 TERMINATED)
  164.  
  165.  
  166. Trial train_0d026a14 completed. Last result: mean_reward=-353.973561682491
  167. == Status ==
  168. Memory usage on this node: 3.4/22.1 GiB
  169. Using FIFO scheduling algorithm.
  170. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  171. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  172. Number of trials: 10/100 (1 RUNNING, 9 TERMINATED)
  173.  
  174.  
  175. Trial train_60cd5f5e reported mean_reward=-418.6660568737172 with parameters={'n_epochs': 26, 'gamma': 0.9170742548714172, 'ent_coef': 0.0066875363827742255, 'learning_rate': 2.0938036433995396e-05, 'vf_coef': 0.3222517430471752, 'gae_lambda': 0.925860242134432, 'max_grad_norm': 0.6038030857142938, 'n_steps': 128, 'batch_size': 64, 'n_envs': 2, 'clip_range': 4.042212950979437}.
  176. == Status ==
  177. Memory usage on this node: 3.8/22.1 GiB
  178. Using FIFO scheduling algorithm.
  179. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  180. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  181. Number of trials: 10/100 (1 RUNNING, 9 TERMINATED)
  182.  
  183.  
  184. Trial train_60cd5f5e completed. Last result: mean_reward=-418.6660568737172
  185. == Status ==
  186. Memory usage on this node: 3.4/22.1 GiB
  187. Using FIFO scheduling algorithm.
  188. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  189. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  190. Number of trials: 11/100 (1 RUNNING, 10 TERMINATED)
  191.  
  192.  
  193. Trial train_468a32d4 reported mean_reward=-136.98070570582593 with parameters={'n_epochs': 46, 'gamma': 0.9629811000737964, 'ent_coef': 0.027311997272207015, 'learning_rate': 0.0004115031634608157, 'vf_coef': 0.5405933912758265, 'gae_lambda': 0.8278340050933533, 'max_grad_norm': 8.777612093173182, 'n_steps': 1024, 'batch_size': 64, 'n_envs': 4, 'clip_range': 4.965771390598875}.
  194. == Status ==
  195. Memory usage on this node: 3.9/22.1 GiB
  196. Using FIFO scheduling algorithm.
  197. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  198. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  199. Number of trials: 11/100 (1 RUNNING, 10 TERMINATED)
  200.  
  201.  
  202. Trial train_468a32d4 completed. Last result: mean_reward=-136.98070570582593
  203. == Status ==
  204. Memory usage on this node: 3.6/22.1 GiB
  205. Using FIFO scheduling algorithm.
  206. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  207. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  208. Number of trials: 12/100 (1 RUNNING, 11 TERMINATED)
  209.  
  210.  
  211. Trial train_d28148de reported mean_reward=-523.316864668492 with parameters={'n_epochs': 45, 'gamma': 0.9663620261046, 'ent_coef': 0.026927000067760297, 'learning_rate': 0.00043626925339494287, 'vf_coef': 0.502262879731641, 'gae_lambda': 0.8029244778314745, 'max_grad_norm': 6.316077255521917, 'n_steps': 1024, 'batch_size': 64, 'n_envs': 4, 'clip_range': 4.999094414396029}.
  212. == Status ==
  213. Memory usage on this node: 3.9/22.1 GiB
  214. Using FIFO scheduling algorithm.
  215. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  216. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  217. Number of trials: 12/100 (1 RUNNING, 11 TERMINATED)
  218.  
  219.  
  220. Trial train_d28148de completed. Last result: mean_reward=-523.316864668492
  221. == Status ==
  222. Memory usage on this node: 3.3/22.1 GiB
  223. Using FIFO scheduling algorithm.
  224. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  225. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  226. Number of trials: 13/100 (1 RUNNING, 12 TERMINATED)
  227.  
  228.  
  229. Trial train_f65e2628 reported mean_reward=-129.4695349349375 with parameters={'n_epochs': 50, 'gamma': 0.9590342427515501, 'ent_coef': 0.028309263747675693, 'learning_rate': 0.00047207703999468183, 'vf_coef': 0.6388233590517254, 'gae_lambda': 0.8436750225600244, 'max_grad_norm': 9.008647671013073, 'n_steps': 1024, 'batch_size': 64, 'n_envs': 4, 'clip_range': 4.728036699543357}.
  230. == Status ==
  231. Memory usage on this node: 3.8/22.1 GiB
  232. Using FIFO scheduling algorithm.
  233. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  234. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  235. Number of trials: 13/100 (1 RUNNING, 12 TERMINATED)
  236.  
  237.  
  238. Trial train_f65e2628 completed. Last result: mean_reward=-129.4695349349375
  239. == Status ==
  240. Memory usage on this node: 3.6/22.1 GiB
  241. Using FIFO scheduling algorithm.
  242. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  243. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  244. Number of trials: 14/100 (1 RUNNING, 13 TERMINATED)
  245.  
  246.  
  247. Trial train_cc8ba158 reported mean_reward=-1306.7836980176983 with parameters={'n_epochs': 40, 'gamma': 0.9283325993414108, 'ent_coef': 0.03940899994950076, 'learning_rate': 0.0001649347924821429, 'vf_coef': 0.6876966801913331, 'gae_lambda': 0.862118702812846, 'max_grad_norm': 3.6530161845355695, 'n_steps': 1024, 'batch_size': 64, 'n_envs': 4, 'clip_range': 3.928949949033421}.
  248. == Status ==
  249. Memory usage on this node: 3.9/22.1 GiB
  250. Using FIFO scheduling algorithm.
  251. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  252. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  253. Number of trials: 14/100 (1 RUNNING, 13 TERMINATED)
  254.  
  255.  
  256. Trial train_cc8ba158 completed. Last result: mean_reward=-1306.7836980176983
  257. == Status ==
  258. Memory usage on this node: 3.6/22.1 GiB
  259. Using FIFO scheduling algorithm.
  260. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  261. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  262. Number of trials: 15/100 (1 RUNNING, 14 TERMINATED)
  263.  
  264.  
  265. Trial train_a32a66e6 reported mean_reward=-131.21861467633474 with parameters={'n_epochs': 40, 'gamma': 0.9744751082393622, 'ent_coef': 0.056704691945807725, 'learning_rate': 0.00019449118368446654, 'vf_coef': 0.3757978012627515, 'gae_lambda': 0.9272263038583353, 'max_grad_norm': 8.607097833696697, 'n_steps': 128, 'batch_size': 64, 'n_envs': 4, 'clip_range': 4.062748253305418}.
  266. == Status ==
  267. Memory usage on this node: 3.9/22.1 GiB
  268. Using FIFO scheduling algorithm.
  269. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  270. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  271. Number of trials: 15/100 (1 RUNNING, 14 TERMINATED)
  272.  
  273.  
  274. Trial train_a32a66e6 completed. Last result: mean_reward=-131.21861467633474
  275. == Status ==
  276. Memory usage on this node: 3.6/22.1 GiB
  277. Using FIFO scheduling algorithm.
  278. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  279. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  280. Number of trials: 16/100 (1 RUNNING, 15 TERMINATED)
  281.  
  282.  
  283. Trial train_9dc6abda reported mean_reward=-576.6606970102672 with parameters={'n_epochs': 38, 'gamma': 0.9778318433329929, 'ent_coef': 0.01583639524093322, 'learning_rate': 0.00010891966607336705, 'vf_coef': 0.6661647766886074, 'gae_lambda': 0.9380584763832434, 'max_grad_norm': 9.874016303630176, 'n_steps': 256, 'batch_size': 64, 'n_envs': 4, 'clip_range': 3.2519434580767843}.
  284. == Status ==
  285. Memory usage on this node: 3.9/22.1 GiB
  286. Using FIFO scheduling algorithm.
  287. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  288. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  289. Number of trials: 16/100 (1 RUNNING, 15 TERMINATED)
  290.  
  291.  
  292. Trial train_9dc6abda completed. Last result: mean_reward=-576.6606970102672
  293. == Status ==
  294. Memory usage on this node: 3.3/22.1 GiB
  295. Using FIFO scheduling algorithm.
  296. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  297. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  298. Number of trials: 17/100 (1 RUNNING, 16 TERMINATED)
  299.  
  300.  
  301. Trial train_18a1ed0e reported mean_reward=-124.94211826171923 with parameters={'n_epochs': 50, 'gamma': 0.9989847047078314, 'ent_coef': 0.04800913827756165, 'learning_rate': 0.0004902849988308048, 'vf_coef': 0.4121798462876674, 'gae_lambda': 0.8719406578825957, 'max_grad_norm': 1.0298862201407044, 'n_steps': 128, 'batch_size': 64, 'n_envs': 2, 'clip_range': 3.528424586912717}.
  302. == Status ==
  303. Memory usage on this node: 3.8/22.1 GiB
  304. Using FIFO scheduling algorithm.
  305. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  306. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  307. Number of trials: 17/100 (1 RUNNING, 16 TERMINATED)
  308.  
  309.  
  310. Trial train_18a1ed0e completed. Last result: mean_reward=-124.94211826171923
  311. == Status ==
  312. Memory usage on this node: 3.6/22.1 GiB
  313. Using FIFO scheduling algorithm.
  314. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  315. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  316. Number of trials: 18/100 (1 RUNNING, 17 TERMINATED)
  317.  
  318.  
  319. Trial train_53ac1436 reported mean_reward=-130.7688301089932 with parameters={'n_epochs': 50, 'gamma': 0.9942527657352764, 'ent_coef': 0.021232492043020787, 'learning_rate': 0.0004778031919789803, 'vf_coef': 0.7711064359329675, 'gae_lambda': 0.8384972623663133, 'max_grad_norm': 0.9535358681684816, 'n_steps': 1024, 'batch_size': 32, 'n_envs': 2, 'clip_range': 3.2537549730930304}.
  320. == Status ==
  321. Memory usage on this node: 3.9/22.1 GiB
  322. Using FIFO scheduling algorithm.
  323. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  324. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  325. Number of trials: 18/100 (1 RUNNING, 17 TERMINATED)
  326.  
  327.  
  328. Trial train_53ac1436 completed. Last result: mean_reward=-130.7688301089932
  329. == Status ==
  330. Memory usage on this node: 3.6/22.1 GiB
  331. Using FIFO scheduling algorithm.
  332. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  333. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  334. Number of trials: 19/100 (1 RUNNING, 18 TERMINATED)
  335.  
  336.  
  337. Trial train_9cb5b8d0 reported mean_reward=-668.8659370243561 with parameters={'n_epochs': 48, 'gamma': 0.9516761496092414, 'ent_coef': 0.042273852466731106, 'learning_rate': 6.653707760614514e-06, 'vf_coef': 0.9639489273850157, 'gae_lambda': 0.877809891197249, 'max_grad_norm': 0.023520227397575245, 'n_steps': 128, 'batch_size': 128, 'n_envs': 2, 'clip_range': 3.2712656819696693}.
  338. == Status ==
  339. Memory usage on this node: 3.9/22.1 GiB
  340. Using FIFO scheduling algorithm.
  341. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  342. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  343. Number of trials: 19/100 (1 RUNNING, 18 TERMINATED)
  344.  
  345.  
  346. Trial train_9cb5b8d0 completed. Last result: mean_reward=-668.8659370243561
  347. == Status ==
  348. Memory usage on this node: 3.6/22.1 GiB
  349. Using FIFO scheduling algorithm.
  350. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  351. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  352. Number of trials: 20/100 (1 RUNNING, 19 TERMINATED)
  353.  
  354.  
  355. Trial train_e0e0392c reported mean_reward=-860.4115850948813 with parameters={'n_epochs': 36, 'gamma': 0.9524583212847471, 'ent_coef': 0.04864675402341336, 'learning_rate': 0.00031397879001347653, 'vf_coef': 0.4401889443344522, 'gae_lambda': 0.8257528097265099, 'max_grad_norm': 1.242692172556195, 'n_steps': 1024, 'batch_size': 256, 'n_envs': 2, 'clip_range': 4.576516806983119}.
  356. == Status ==
  357. Memory usage on this node: 3.9/22.1 GiB
  358. Using FIFO scheduling algorithm.
  359. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  360. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  361. Number of trials: 20/100 (1 RUNNING, 19 TERMINATED)
  362.  
  363.  
  364. Trial train_e0e0392c completed. Last result: mean_reward=-860.4115850948813
  365. == Status ==
  366. Memory usage on this node: 3.6/22.1 GiB
  367. Using FIFO scheduling algorithm.
  368. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  369. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  370. Number of trials: 21/100 (1 RUNNING, 20 TERMINATED)
  371.  
  372.  
  373. Trial train_076c547e reported mean_reward=-543.0008129155872 with parameters={'n_epochs': 50, 'gamma': 0.98329575524038, 'ent_coef': 0.014719623141605522, 'learning_rate': 0.00011620204489079275, 'vf_coef': 0.6476297730714322, 'gae_lambda': 0.8560826380436493, 'max_grad_norm': 0.28220896098020737, 'n_steps': 256, 'batch_size': 64, 'n_envs': 2, 'clip_range': 3.635168304559443}.
  374. == Status ==
  375. Memory usage on this node: 3.9/22.1 GiB
  376. Using FIFO scheduling algorithm.
  377. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  378. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  379. Number of trials: 21/100 (1 RUNNING, 20 TERMINATED)
  380.  
  381.  
  382. Trial train_076c547e completed. Last result: mean_reward=-543.0008129155872
  383. == Status ==
  384. Memory usage on this node: 3.6/22.1 GiB
  385. Using FIFO scheduling algorithm.
  386. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  387. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  388. Number of trials: 22/100 (1 RUNNING, 21 TERMINATED)
  389.  
  390.  
  391. Trial train_10e7911a reported mean_reward=-128.84589711904772 with parameters={'n_epochs': 50, 'gamma': 0.9978464043043339, 'ent_coef': 0.022563924452572225, 'learning_rate': 0.0004469832104888018, 'vf_coef': 0.8654210205194833, 'gae_lambda': 0.8351901314622519, 'max_grad_norm': 1.0880141723291787, 'n_steps': 1024, 'batch_size': 32, 'n_envs': 2, 'clip_range': 2.7321952736365525}.
  392. == Status ==
  393. Memory usage on this node: 3.8/22.1 GiB
  394. Using FIFO scheduling algorithm.
  395. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  396. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  397. Number of trials: 22/100 (1 RUNNING, 21 TERMINATED)
  398.  
  399.  
  400. Trial train_10e7911a completed. Last result: mean_reward=-128.84589711904772
  401. == Status ==
  402. Memory usage on this node: 3.1/22.1 GiB
  403. Using FIFO scheduling algorithm.
  404. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  405. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  406. Number of trials: 23/100 (1 RUNNING, 22 TERMINATED)
  407.  
  408.  
  409. Trial train_6ea6e96e reported mean_reward=-134.40476417170285 with parameters={'n_epochs': 44, 'gamma': 0.9961428465056605, 'ent_coef': 0.029913447495994137, 'learning_rate': 0.00047275816020251986, 'vf_coef': 0.8775199156779477, 'gae_lambda': 0.8378232392682137, 'max_grad_norm': 1.4147699319653861, 'n_steps': 1024, 'batch_size': 32, 'n_envs': 2, 'clip_range': 2.623266794529213}.
  410. == Status ==
  411. Memory usage on this node: 3.9/22.1 GiB
  412. Using FIFO scheduling algorithm.
  413. Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
  414. Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
  415. Number of trials: 23/100 (1 RUNNING, 22 TERMINATED)
  416.  
  417.  
RAW Paste Data