Advertisement

Untitled

a guest

May 23rd, 2019

98

0

Never

Add comment

Not a member of Pastebin yet? Sign Up, it unlocks many cool features!

text 0.23 KB | None | 0 0

raw download clone embed print report

policy_values = []
eps = np.linspace(0, 2/3, num = 10)
for e in eps:
policy = make_epsilon_greedy_policy(e)
policy_values.append( evaluate_policy_return(T, behavioral_policy, policy) )
plt.plot(eps, policy_values)
plt.show()

Advertisement

Add Comment

Please, Sign In to add comment

Advertisement

Public Pastes

Untitled
C++ | 12 min ago | 2.26 KB
Untitled
C++ | 13 min ago | 2.06 KB
Untitled
C++ | 13 min ago | 2.26 KB
Untitled
C++ | 14 min ago | 1.98 KB
Untitled
C++ | 14 min ago | 2.21 KB
Untitled
C++ | 15 min ago | 1.59 KB
Untitled
C++ | 16 min ago | 1.27 KB
Incident table
TypeScript | 17 min ago | 3.61 KB

Advertisement