Advertisement
Guest User

Untitled

a guest
Dec 30th, 2020
91
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
YAML 0.85 KB | None | 0 0
  1. behaviors:
  2.   MonsterAgent:
  3.     trainer_type: ppo
  4.     hyperparameters:
  5.       batch_size: 64
  6.       buffer_size: 12000
  7.       learning_rate: 0.0003
  8.       beta: 0.001
  9.       epsilon: 0.2
  10.       lambd: 0.99
  11.       num_epoch: 3
  12.       learning_rate_schedule: linear
  13.     network_settings:
  14.       normalize: true
  15.       hidden_units: 128
  16.       num_layers: 2
  17.       vis_encode_type: simple
  18.     reward_signals:
  19.       extrinsic:
  20.         gamma: 0.99
  21.         strength: 1.0
  22.       curiosity:
  23.         strength: 0.02
  24.         gamma: 0.99
  25.         encoding_size: 256
  26.         learning_rate: 3.0e-4
  27.     keep_checkpoints: 5
  28.     max_steps: 50000000
  29.     time_horizon: 1000
  30.     summary_freq: 12000
  31.     threaded: true
  32.     self_play:
  33.       window: 10
  34.       play_against_latest_model_ratio: 0.5
  35.       save_steps: 50000
  36.       swap_steps: 2000
  37.       team_change: 100000
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement