BigSnarf blog

Infosec FTW

Reinforcement Learning asynchronous gradient descent

CkesulXUoAEBDuW

Screen Shot 2016-06-09 at 6.43.07 AM

http://arxiv.org/pdf/1602.01783v1.pdf

https://www.cs.toronto.edu/~vmnih/docs/dqn.pdf

Guest Post (Part I): Demystifying Deep Reinforcement Learning

https://github.com/coreylynch/async-rl

https://classroom.udacity.com/courses/ud600/lessons/4100878601/concepts/6512308640923

http://rll.berkeley.edu/deeprlcourse/#lecture-videos

http://www0.cs.ucl.ac.uk/staff/D.Silver/web/Teaching.html

https://www.cs.ox.ac.uk/people/nando.defreitas/machinelearning/

http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching.html

http://cs.stanford.edu/people/karpathy/reinforcejs/

http://www.seas.upenn.edu/~cis519/fall2015/lectures/14_ReinforcementLearning.pdf

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: