More actions
imported>bluemir No edit summary |
imported>zerobot No edit summary |
||
| Line 3: | Line 3: | ||
* [http://sanghyukchun.github.io/90/ "Deep Q Learning"] | * [http://sanghyukchun.github.io/90/ "Deep Q Learning"] | ||
== Links == | == Links == | ||
* http://www.arcadelearningenvironment.org/ | |||
* https://www-s.acm.illinois.edu/sigart/docs/QLearning.pdf | * https://www-s.acm.illinois.edu/sigart/docs/QLearning.pdf | ||
* http://www.jmlr.org/papers/volume5/evendar03a/evendar03a.pdf | * http://www.jmlr.org/papers/volume5/evendar03a/evendar03a.pdf | ||
Revision as of 13:45, 5 April 2016
Deep Learning
Links
- http://www.arcadelearningenvironment.org/
- https://www-s.acm.illinois.edu/sigart/docs/QLearning.pdf
- http://www.jmlr.org/papers/volume5/evendar03a/evendar03a.pdf
- https://www.cs.cmu.edu/afs/cs/project/jair/pub/volume4/kaelbling96a-html/node25.html
- http://www.cse.unsw.edu.au/~cs9417ml/RL1/algorithms.html
- https://webdocs.cs.ualberta.ca/~sutton/book/ebook/the-book.html
- http://www.gatsby.ucl.ac.uk/~dayan/papers/cjch.pdf
- https://github.com/soumith/cvpr2015/blob/master/DQN%20Training%20iTorch.ipynb
- https://sites.google.com/a/deepmind.com/dqn/home/Human_Level_Control_through_Deep_Reinforcement_Learning.zip?attredirects=0&d=1
- https://github.com/deepmind/xitari