Advertisement
Guest User

Untitled

a guest
May 23rd, 2019
98
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 0.23 KB | None | 0 0
  1. policy_values = []
  2. eps = np.linspace(0, 2/3, num = 10)
  3. for e in eps:
  4. policy = make_epsilon_greedy_policy(e)
  5. policy_values.append( evaluate_policy_return(T, behavioral_policy, policy) )
  6.  
  7. plt.plot(eps, policy_values)
  8. plt.show()
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement