Mastering the Game of Go with Deep Neural Networks and Tree Search

Mastering the Game of Go with Deep Neural Networks and Tree Search

Mastering the game of Go with deep neural networks and tree search Nature, Jan, 2016 Roadmap What this paper is about? • Deep Learning • Search problem • How to explore a huge tree (graph) AlphaGo Video https://www.youtube.com/watch?v=53YLZBSS0cc https://www.youtube.com/watch?v=g-dKXOlsf98 rank AlphaGo vs*European*Champion*(Fan*Hui 27Dan)* October$5$– 9,$2015 <Official$match> I Time+limit:+1+hour I AlphaGo Wins (5:0) AlphaGo vs*World*Champion*(Lee*Sedol 97Dan) March$9$– 15,$2016 <Official$match> I Time+limit:+2+hours Venue:+Seoul,+Four+Seasons+Hotel Image+Source: Josun+Times Jan+28th 2015 Lee*Sedol wiki Photo+source: Maeil+Economics 2013/04 Lee Sedol AlphaGo is*estimated*to*be*around*~57dan =$multiple$machines European$champion The Game Go Elo Ranking http://www.goratings.org/history/ Lee Sedol VS Ke Jie How about Other Games? Tic Tac Toe Chess Chess (1996) Deep Blue (1996) AlphaGo is the Skynet? Go Game Simple Rules High Complexity High Complexity Different Games Search Problem (the search space) Tic Tac Toe Tic Tac Toe The “Tree” in Tic Tac Toe The “Tree” of Chess The “Tree” of Go Game Search Problem (how to search) MiniMax in Tic Tac Toe Adversarial"Search"–"MiniMax"" 1" 1" 1" 0" J1" 0" 0" 0" J1" J1" J1" J1" 5" Adversarial"Search"–"MiniMax"" J1" 1" 0" J1" 1" 1" J1" 0" J1" J1" 1" 1" 1" 0" J1" 0" 0" 0" J1" J1" J1" J1" 6" What is the problem? 1. Generate the Search Tree 2. use MinMax Search The Size of the Tree Tic Tac Toe: b = 9, d =9 Chess: b = 35, d =80 Go: b = 250, d =150 b : number of legal move per position d : its depth (game length) One Grain of Rice https://www.youtube.com/watch?v=byk3pA1GPgU The “Space” of GO Game How about other Games? • Flappy bird? Tic Tac Toe: • Angry Bird? b = 9, d =9 • Starcraft? Chess: • learning a language b = 35, d =80 • Write a paper Go: Get a MS/PhD degree b = 250, d =150 • • Finding a job • Life How to solve? Chess (1996) Monte Carlo Las Vegas Monte"Carlo"Tree"Search" Tree"search" ……." ……." ……." ……." Monte"Carlo"search" ……." ……." ……." ……." ……." 7" Monte"Carlo"Tree"Search" • Tree"Search"+"Monte"Carlo"Method"" – SelecIon" – Expansion" 3/5" white"wins"/"total" – SimulaIon" 2/3" 1/2" – BackJPropagaIon" 1/1" 1/2" 1/1" 0/1" 1/1" 0/1" 8".

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    51 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us