Advertisement
Not a member of Pastebin yet?
Sign Up,
it unlocks many cool features!
- updateValue = lastReward + self.alpha * (1 / (lastNumberOfVisits)) * (reward + self.gamma * currentReward - lastReward)
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement