More actions
imported>rabierre No edit summary |
imported>rabierre No edit summary |
||
| Line 1: | Line 1: | ||
== Reinforcement Learning == | == Reinforcement Learning == | ||
=== Lecture 5: Model Free Control === | === Lecture 5: Model Free Control === | ||
동영상 주소: https://www.youtube.com/watch?v=0g4j2k_Ggc4&t=2466s | |||
* on policy vs off policy | |||
* ε-Greedy | * ε-Greedy | ||
* Sarsa | * Sarsa | ||
Revision as of 06:48, 5 August 2017
Reinforcement Learning
Lecture 5: Model Free Control
동영상 주소: https://www.youtube.com/watch?v=0g4j2k_Ggc4&t=2466s
- on policy vs off policy
- ε-Greedy
- Sarsa