Deep Reinforcement Learning at Scale

Deep Reinforcement Learning at Scale

Deep Reinforcement Learning at Scale Timothy Lillicrap Research Scientist, DeepMind & UCL Deep Learning at Supercomputer Scale | NIPS Workshop What is Reinforcement Learning? Supervised Learning Reinforcement Learning Fixed dataset Data depends on actions taken in environment Formalizing the Agent-Environment Loop Actions Agent Environment Neural Network(s) Rewards Observations Advantage Actor-Critic (A3C) Mnih et al., ICML 2016 A Single Trial (with Advantage Actor-Critic) Time Mnih et al., ICML 2016 Combating Variance: Advantage Actor-Critic Mnih et al., ICML 2016 Scaling Reinforcement Learning (A3C) Parameters Parameter Server Actor / Learner Gradients Mnih et al., ICML 2016 Scaling Reinforcement Learning Experience Learner(s) Replay Buffer Updated Priorities Parameters Experience + Initial Priorities Actors Horgan et al., 2017 & Schaul et al. 2015 Off-policy Actor-Critic for Continuous Actions Lillicrap et al., ICLR 2016 Distributional Distributed DDPG (D4PG) Hoffman, Barth-Maron et al., 2017 Hoffman, Barth-Maron et al., 2017 Hoffman, Barth-Maron et al., 2017 Hoffman, Barth-Maron et al., 2017 Hoffman, Barth-Maron et al., 2017 Hoffman, Barth-Maron et al., 2017 Hoffman, Barth-Maron et al., 2017 Distributional Distributed DDPG (D4PG) Hoffman, Barth-Maron et al., 2017 Hoffman, Barth-Maron et al., 2017 Hoffman, Barth-Maron et al., 2017 Playing Go with Deep Networks and Planning Use environment model in order to plan! Silver, Huang et al., Nature, 2016 Training Policy and Value Networks Silver, Huang et al., Nature, 2016 Planning with an Environment Model & MCTS Silver, Huang et al., Nature, 2016 Planning with an Environment Model Silver, Huang et al., Nature, 2016 Playing Go with Without Human Knowledge Silver, Schrittwieser, Simonyan, et al. Nature, 2017 Playing Go with Without Human Knowledge z Silver, Schrittwieser, Simonyan, et al. Nature, 2017 Playing Go with Without Human Knowledge Silver, Schrittwieser, Simonyan, et al. Nature, 2017 Playing Go with Without Human Knowledge Silver, Schrittwieser, Simonyan, et al. Nature, 2017 Playing Go with Without Human Knowledge Silver, Schrittwieser, Simonyan, et al. Nature, 2017 Playing Go with Without Human Knowledge Silver, Schrittwieser, Simonyan, et al. Nature, 2017 Playing Go with Without Human Knowledge Silver, Schrittwieser, Simonyan, et al. Nature, 2017 Questions?.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    31 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us