Untitled

nohup: ignoring input
[32m[I 2021-04-06 22:02:35,181][0m A new study created in memory with name: optuna[0m
2021-04-06 22:02:35,188	INFO worker.py:654 -- Connecting to existing Ray cluster at address: 10.128.0.18:6379
2021-04-06 22:02:35,241	WARNING function_runner.py:545 -- Function checkpointing is disabled. This may result in unexpected behavior when using checkpointing features or certain schedulers. To enable, set the train function arguments to be `func(config, checkpoint_dir=None)`.
== Status ==
Memory usage on this node: 1.2/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 0/6 CPUs, 0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 1/100 (1 PENDING)


Trial train_cb329ede reported mean_reward=-131.23800454200708 with parameters={'n_epochs': 28, 'gamma': 0.9208008749298612, 'ent_coef': 0.07838595977369499, 'learning_rate': 0.00024377717754597446, 'vf_coef': 0.4063916805295914, 'gae_lambda': 0.9080348648918256, 'max_grad_norm': 3.3720863271450323, 'n_steps': 128, 'batch_size': 64, 'n_envs': 4, 'clip_range': 4.4916679570111}.
== Status ==
Memory usage on this node: 3.7/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 1/100 (1 RUNNING)


Trial train_cb329ede completed. Last result: mean_reward=-131.23800454200708
== Status ==
Memory usage on this node: 3.5/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 2/100 (1 RUNNING, 1 TERMINATED)


Trial train_587a9c0e reported mean_reward=-747.1447419311 with parameters={'n_epochs': 10, 'gamma': 0.9409558869890438, 'ent_coef': 0.09354804296077848, 'learning_rate': 3.566834690355431e-05, 'vf_coef': 0.31884445314719273, 'gae_lambda': 0.9066677687619378, 'max_grad_norm': 0.44119894428950707, 'n_steps': 2048, 'batch_size': 64, 'n_envs': 8, 'clip_range': 1.5987089847642335}.
== Status ==
Memory usage on this node: 3.7/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 2/100 (1 RUNNING, 1 TERMINATED)


Trial train_587a9c0e completed. Last result: mean_reward=-747.1447419311
== Status ==
Memory usage on this node: 3.5/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 3/100 (1 RUNNING, 2 TERMINATED)


Trial train_2467d8e0 reported mean_reward=-575.1108521857764 with parameters={'n_epochs': 19, 'gamma': 0.9927378898574789, 'ent_coef': 0.0029549299244529797, 'learning_rate': 4.6406339680686644e-05, 'vf_coef': 0.9734658735035873, 'gae_lambda': 0.8029049212061022, 'max_grad_norm': 0.022285111579608642, 'n_steps': 4096, 'batch_size': 32, 'n_envs': 2, 'clip_range': 1.1308224052869966}.
== Status ==
Memory usage on this node: 3.8/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 3/100 (1 RUNNING, 2 TERMINATED)


Trial train_2467d8e0 completed. Last result: mean_reward=-575.1108521857764
== Status ==
Memory usage on this node: 3.6/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 4/100 (1 RUNNING, 3 TERMINATED)


Trial train_929317f6 reported mean_reward=-725.4147631028152 with parameters={'n_epochs': 5, 'gamma': 0.9026014570728472, 'ent_coef': 0.01185128893399963, 'learning_rate': 7.086151452253238e-05, 'vf_coef': 0.5025901486338282, 'gae_lambda': 0.9725369111603837, 'max_grad_norm': 1.9069490545616843, 'n_steps': 512, 'batch_size': 128, 'n_envs': 4, 'clip_range': 0.5710767833464744}.
== Status ==
Memory usage on this node: 3.8/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 4/100 (1 RUNNING, 3 TERMINATED)


Trial train_929317f6 completed. Last result: mean_reward=-725.4147631028152
== Status ==
Memory usage on this node: 3.6/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 5/100 (1 RUNNING, 4 TERMINATED)


Trial train_d62efdb4 reported mean_reward=-666.8855130077344 with parameters={'n_epochs': 29, 'gamma': 0.904328783503791, 'ent_coef': 0.0041722256942914315, 'learning_rate': 4.9200815567235365e-05, 'vf_coef': 0.8324089498584022, 'gae_lambda': 0.9619214943284222, 'max_grad_norm': 0.6483644166834951, 'n_steps': 4096, 'batch_size': 64, 'n_envs': 8, 'clip_range': 0.5255197235122465}.
== Status ==
Memory usage on this node: 3.8/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 5/100 (1 RUNNING, 4 TERMINATED)


Trial train_d62efdb4 completed. Last result: mean_reward=-666.8855130077344
== Status ==
Memory usage on this node: 3.5/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 6/100 (1 RUNNING, 5 TERMINATED)


Trial train_12450bd0 reported mean_reward=-515.8101825047949 with parameters={'n_epochs': 4, 'gamma': 0.9264480859953559, 'ent_coef': 0.09888414074175368, 'learning_rate': 0.00021353130691370085, 'vf_coef': 0.8190135747804081, 'gae_lambda': 0.987473853510914, 'max_grad_norm': 0.15330445760802, 'n_steps': 512, 'batch_size': 128, 'n_envs': 4, 'clip_range': 0.7822732380246952}.
== Status ==
Memory usage on this node: 3.9/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 6/100 (1 RUNNING, 5 TERMINATED)


Trial train_12450bd0 completed. Last result: mean_reward=-515.8101825047949
== Status ==
Memory usage on this node: 3.4/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 7/100 (1 RUNNING, 6 TERMINATED)


Trial train_4f8f4bac reported mean_reward=-367.40078629461095 with parameters={'n_epochs': 6, 'gamma': 0.9625505106830452, 'ent_coef': 0.08371541626299007, 'learning_rate': 2.2259750862440304e-05, 'vf_coef': 0.5919794293806709, 'gae_lambda': 0.8851230247038313, 'max_grad_norm': 0.14011078791160458, 'n_steps': 2048, 'batch_size': 256, 'n_envs': 4, 'clip_range': 1.702862127551212}.
== Status ==
Memory usage on this node: 3.8/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 7/100 (1 RUNNING, 6 TERMINATED)


Trial train_4f8f4bac completed. Last result: mean_reward=-367.40078629461095
== Status ==
Memory usage on this node: 3.4/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 8/100 (1 RUNNING, 7 TERMINATED)


Trial train_05f6eaa0 reported mean_reward=-557.8676138391057 with parameters={'n_epochs': 25, 'gamma': 0.9428404130957041, 'ent_coef': 0.006055945743195439, 'learning_rate': 4.1305210345635324e-05, 'vf_coef': 0.21865683016229348, 'gae_lambda': 0.8547134674500554, 'max_grad_norm': 2.2272110059180545, 'n_steps': 128, 'batch_size': 256, 'n_envs': 8, 'clip_range': 2.161152723526277}.
== Status ==
Memory usage on this node: 3.8/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 8/100 (1 RUNNING, 7 TERMINATED)


Trial train_05f6eaa0 completed. Last result: mean_reward=-557.8676138391057
== Status ==
Memory usage on this node: 3.4/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 9/100 (1 RUNNING, 8 TERMINATED)


Trial train_0d026a14 reported mean_reward=-353.973561682491 with parameters={'n_epochs': 30, 'gamma': 0.9853126801731658, 'ent_coef': 0.0015959034947565314, 'learning_rate': 5.725703098457397e-06, 'vf_coef': 0.14252884462824258, 'gae_lambda': 0.9594912915123677, 'max_grad_norm': 0.07635667656761054, 'n_steps': 256, 'batch_size': 32, 'n_envs': 8, 'clip_range': 2.743715763530861}.
== Status ==
Memory usage on this node: 3.8/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 9/100 (1 RUNNING, 8 TERMINATED)


Trial train_0d026a14 completed. Last result: mean_reward=-353.973561682491
== Status ==
Memory usage on this node: 3.4/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 10/100 (1 RUNNING, 9 TERMINATED)


Trial train_60cd5f5e reported mean_reward=-418.6660568737172 with parameters={'n_epochs': 26, 'gamma': 0.9170742548714172, 'ent_coef': 0.0066875363827742255, 'learning_rate': 2.0938036433995396e-05, 'vf_coef': 0.3222517430471752, 'gae_lambda': 0.925860242134432, 'max_grad_norm': 0.6038030857142938, 'n_steps': 128, 'batch_size': 64, 'n_envs': 2, 'clip_range': 4.042212950979437}.
== Status ==
Memory usage on this node: 3.8/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 10/100 (1 RUNNING, 9 TERMINATED)


Trial train_60cd5f5e completed. Last result: mean_reward=-418.6660568737172
== Status ==
Memory usage on this node: 3.4/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 11/100 (1 RUNNING, 10 TERMINATED)


Trial train_468a32d4 reported mean_reward=-136.98070570582593 with parameters={'n_epochs': 46, 'gamma': 0.9629811000737964, 'ent_coef': 0.027311997272207015, 'learning_rate': 0.0004115031634608157, 'vf_coef': 0.5405933912758265, 'gae_lambda': 0.8278340050933533, 'max_grad_norm': 8.777612093173182, 'n_steps': 1024, 'batch_size': 64, 'n_envs': 4, 'clip_range': 4.965771390598875}.
== Status ==
Memory usage on this node: 3.9/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 11/100 (1 RUNNING, 10 TERMINATED)


Trial train_468a32d4 completed. Last result: mean_reward=-136.98070570582593
== Status ==
Memory usage on this node: 3.6/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 12/100 (1 RUNNING, 11 TERMINATED)


Trial train_d28148de reported mean_reward=-523.316864668492 with parameters={'n_epochs': 45, 'gamma': 0.9663620261046, 'ent_coef': 0.026927000067760297, 'learning_rate': 0.00043626925339494287, 'vf_coef': 0.502262879731641, 'gae_lambda': 0.8029244778314745, 'max_grad_norm': 6.316077255521917, 'n_steps': 1024, 'batch_size': 64, 'n_envs': 4, 'clip_range': 4.999094414396029}.
== Status ==
Memory usage on this node: 3.9/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 12/100 (1 RUNNING, 11 TERMINATED)


Trial train_d28148de completed. Last result: mean_reward=-523.316864668492
== Status ==
Memory usage on this node: 3.3/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 13/100 (1 RUNNING, 12 TERMINATED)


Trial train_f65e2628 reported mean_reward=-129.4695349349375 with parameters={'n_epochs': 50, 'gamma': 0.9590342427515501, 'ent_coef': 0.028309263747675693, 'learning_rate': 0.00047207703999468183, 'vf_coef': 0.6388233590517254, 'gae_lambda': 0.8436750225600244, 'max_grad_norm': 9.008647671013073, 'n_steps': 1024, 'batch_size': 64, 'n_envs': 4, 'clip_range': 4.728036699543357}.
== Status ==
Memory usage on this node: 3.8/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 13/100 (1 RUNNING, 12 TERMINATED)


Trial train_f65e2628 completed. Last result: mean_reward=-129.4695349349375
== Status ==
Memory usage on this node: 3.6/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 14/100 (1 RUNNING, 13 TERMINATED)


Trial train_cc8ba158 reported mean_reward=-1306.7836980176983 with parameters={'n_epochs': 40, 'gamma': 0.9283325993414108, 'ent_coef': 0.03940899994950076, 'learning_rate': 0.0001649347924821429, 'vf_coef': 0.6876966801913331, 'gae_lambda': 0.862118702812846, 'max_grad_norm': 3.6530161845355695, 'n_steps': 1024, 'batch_size': 64, 'n_envs': 4, 'clip_range': 3.928949949033421}.
== Status ==
Memory usage on this node: 3.9/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 14/100 (1 RUNNING, 13 TERMINATED)


Trial train_cc8ba158 completed. Last result: mean_reward=-1306.7836980176983
== Status ==
Memory usage on this node: 3.6/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 15/100 (1 RUNNING, 14 TERMINATED)


Trial train_a32a66e6 reported mean_reward=-131.21861467633474 with parameters={'n_epochs': 40, 'gamma': 0.9744751082393622, 'ent_coef': 0.056704691945807725, 'learning_rate': 0.00019449118368446654, 'vf_coef': 0.3757978012627515, 'gae_lambda': 0.9272263038583353, 'max_grad_norm': 8.607097833696697, 'n_steps': 128, 'batch_size': 64, 'n_envs': 4, 'clip_range': 4.062748253305418}.
== Status ==
Memory usage on this node: 3.9/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 15/100 (1 RUNNING, 14 TERMINATED)


Trial train_a32a66e6 completed. Last result: mean_reward=-131.21861467633474
== Status ==
Memory usage on this node: 3.6/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 16/100 (1 RUNNING, 15 TERMINATED)


Trial train_9dc6abda reported mean_reward=-576.6606970102672 with parameters={'n_epochs': 38, 'gamma': 0.9778318433329929, 'ent_coef': 0.01583639524093322, 'learning_rate': 0.00010891966607336705, 'vf_coef': 0.6661647766886074, 'gae_lambda': 0.9380584763832434, 'max_grad_norm': 9.874016303630176, 'n_steps': 256, 'batch_size': 64, 'n_envs': 4, 'clip_range': 3.2519434580767843}.
== Status ==
Memory usage on this node: 3.9/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 16/100 (1 RUNNING, 15 TERMINATED)


Trial train_9dc6abda completed. Last result: mean_reward=-576.6606970102672
== Status ==
Memory usage on this node: 3.3/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 17/100 (1 RUNNING, 16 TERMINATED)


Trial train_18a1ed0e reported mean_reward=-124.94211826171923 with parameters={'n_epochs': 50, 'gamma': 0.9989847047078314, 'ent_coef': 0.04800913827756165, 'learning_rate': 0.0004902849988308048, 'vf_coef': 0.4121798462876674, 'gae_lambda': 0.8719406578825957, 'max_grad_norm': 1.0298862201407044, 'n_steps': 128, 'batch_size': 64, 'n_envs': 2, 'clip_range': 3.528424586912717}.
== Status ==
Memory usage on this node: 3.8/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 17/100 (1 RUNNING, 16 TERMINATED)


Trial train_18a1ed0e completed. Last result: mean_reward=-124.94211826171923
== Status ==
Memory usage on this node: 3.6/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 18/100 (1 RUNNING, 17 TERMINATED)


Trial train_53ac1436 reported mean_reward=-130.7688301089932 with parameters={'n_epochs': 50, 'gamma': 0.9942527657352764, 'ent_coef': 0.021232492043020787, 'learning_rate': 0.0004778031919789803, 'vf_coef': 0.7711064359329675, 'gae_lambda': 0.8384972623663133, 'max_grad_norm': 0.9535358681684816, 'n_steps': 1024, 'batch_size': 32, 'n_envs': 2, 'clip_range': 3.2537549730930304}.
== Status ==
Memory usage on this node: 3.9/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 18/100 (1 RUNNING, 17 TERMINATED)


Trial train_53ac1436 completed. Last result: mean_reward=-130.7688301089932
== Status ==
Memory usage on this node: 3.6/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 19/100 (1 RUNNING, 18 TERMINATED)


Trial train_9cb5b8d0 reported mean_reward=-668.8659370243561 with parameters={'n_epochs': 48, 'gamma': 0.9516761496092414, 'ent_coef': 0.042273852466731106, 'learning_rate': 6.653707760614514e-06, 'vf_coef': 0.9639489273850157, 'gae_lambda': 0.877809891197249, 'max_grad_norm': 0.023520227397575245, 'n_steps': 128, 'batch_size': 128, 'n_envs': 2, 'clip_range': 3.2712656819696693}.
== Status ==
Memory usage on this node: 3.9/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 19/100 (1 RUNNING, 18 TERMINATED)


Trial train_9cb5b8d0 completed. Last result: mean_reward=-668.8659370243561
== Status ==
Memory usage on this node: 3.6/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 20/100 (1 RUNNING, 19 TERMINATED)


Trial train_e0e0392c reported mean_reward=-860.4115850948813 with parameters={'n_epochs': 36, 'gamma': 0.9524583212847471, 'ent_coef': 0.04864675402341336, 'learning_rate': 0.00031397879001347653, 'vf_coef': 0.4401889443344522, 'gae_lambda': 0.8257528097265099, 'max_grad_norm': 1.242692172556195, 'n_steps': 1024, 'batch_size': 256, 'n_envs': 2, 'clip_range': 4.576516806983119}.
== Status ==
Memory usage on this node: 3.9/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 20/100 (1 RUNNING, 19 TERMINATED)


Trial train_e0e0392c completed. Last result: mean_reward=-860.4115850948813
== Status ==
Memory usage on this node: 3.6/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 21/100 (1 RUNNING, 20 TERMINATED)


Trial train_076c547e reported mean_reward=-543.0008129155872 with parameters={'n_epochs': 50, 'gamma': 0.98329575524038, 'ent_coef': 0.014719623141605522, 'learning_rate': 0.00011620204489079275, 'vf_coef': 0.6476297730714322, 'gae_lambda': 0.8560826380436493, 'max_grad_norm': 0.28220896098020737, 'n_steps': 256, 'batch_size': 64, 'n_envs': 2, 'clip_range': 3.635168304559443}.
== Status ==
Memory usage on this node: 3.9/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 21/100 (1 RUNNING, 20 TERMINATED)


Trial train_076c547e completed. Last result: mean_reward=-543.0008129155872
== Status ==
Memory usage on this node: 3.6/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 22/100 (1 RUNNING, 21 TERMINATED)


Trial train_10e7911a reported mean_reward=-128.84589711904772 with parameters={'n_epochs': 50, 'gamma': 0.9978464043043339, 'ent_coef': 0.022563924452572225, 'learning_rate': 0.0004469832104888018, 'vf_coef': 0.8654210205194833, 'gae_lambda': 0.8351901314622519, 'max_grad_norm': 1.0880141723291787, 'n_steps': 1024, 'batch_size': 32, 'n_envs': 2, 'clip_range': 2.7321952736365525}.
== Status ==
Memory usage on this node: 3.8/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 22/100 (1 RUNNING, 21 TERMINATED)


Trial train_10e7911a completed. Last result: mean_reward=-128.84589711904772
== Status ==
Memory usage on this node: 3.1/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 23/100 (1 RUNNING, 22 TERMINATED)


Trial train_6ea6e96e reported mean_reward=-134.40476417170285 with parameters={'n_epochs': 44, 'gamma': 0.9961428465056605, 'ent_coef': 0.029913447495994137, 'learning_rate': 0.00047275816020251986, 'vf_coef': 0.8775199156779477, 'gae_lambda': 0.8378232392682137, 'max_grad_norm': 1.4147699319653861, 'n_steps': 1024, 'batch_size': 32, 'n_envs': 2, 'clip_range': 2.623266794529213}.
== Status ==
Memory usage on this node: 3.9/22.1 GiB
Using FIFO scheduling algorithm.
Resources requested: 5.0/6 CPUs, 1.0/1 GPUs, 0.0/12.98 GiB heap, 0.0/6.49 GiB objects (0.0/1.0 accelerator_type:T4)
Result logdir: /home/justin_terry/ray_results/train_2021-04-06_22-02-35
Number of trials: 23/100 (1 RUNNING, 22 TERMINATED)