Torrent details for "Liu M. AlphaGo Simplified.Rule-Based AI and Deep Learning in Everyday Games 2025 [andryold1]"    Log in to bookmark

wide
Torrent details
Cover
Download
Torrent rating (0 rated)
Controls:
Category:
Language:
English English
Total Size:
30.21 MB
Info Hash:
c240c04849d7db2352873f2d7c1c119a659386cb
Added By:
Added:  
07-07-2024 09:54
Views:
133
Health:
Seeds:
53
Leechers:
8
Completed:
419
wide




Description
wide
Externally indexed torrent
If you are the original uploader, contact staff to have it moved to your account
Textbook in PDF format

May 11, 1997, was a watershed moment in the history of artificial intelligence (AI): the IBM supercomputer chess engine, Deep Blue, beat the world Chess champion, Garry Kasparov. It was the first time a machine had triumphed over a human player in a Chess tournament. Fast forward 19 years to May 9, 2016, DeepMind’s AlphaGo beat the world Go champion Lee Sedol. AI again stole the spotlight and generated a media frenzy. This time, a new type of AI algorithm, namely machine learning (ML) was the driving force behind the game strategies.
What exactly is ML? How is it related to AI? Why is deep learning (DL) so popular these days? This book explains how traditional rule-based AI and ML work and how they can be imple mented in everyday games such as Last Coin Standing, Tic Tac Toe, or Connect Four. Game rules in these three games are easy to implement. As a result, readers will learn rule-based AI, deep reinforcement learning, and more importantly, how to combine the two to create powerful game strategies (the whole is indeed greater than the sum of its parts) without getting bogged down in complicated game rules.
Preface
Section I Rule-Based AI
Rule-Based AI in the Coin Game
Look-Ahead Search in Tic Tac Toe
Planning Three Steps Ahead in Connect Four
Recursion and MiniMax Tree Search
Depth Pruning in MiniMax
Alpha-Beta Pruning
Position Evaluation in MiniMax
Monte Carlo Tree Search
Section II Deep Learning
Deep Learning in the Coin Game
Policy Networks in Tic Tac Toe
A Policy Network in Connect Four
Section III Reinforcement Learning
Tabular Q-Learning in the Coin Game
Self-Play Deep Reinforcement Learning
Vectorization to Speed Up Deep Reinforcement Learning
A Value Network in Connect Four
Section IV AlphaGo Algorithms
Implementing AlphaGo in the Coin Game
AlphaGo in Tic Tac Toe and Connect Four
Hyperparameter Tuning in AlphaGo
The Actor-Critic Method and AlphaZero
Iterative Self-Play and AlphaZero in Tic Tac Toe
AlphaZero in Unsolved Games
Bibliography

  User comments    Sort newest first

No comments have been posted yet.



Post anonymous comment
  • Comments need intelligible text (not only emojis or meaningless drivel).
  • No upload requests, visit the forum or message the uploader for this.
  • Use common sense and try to stay on topic.

  • :) :( :D :P :-) B) 8o :? 8) ;) :-* :-( :| O:-D Party Pirates Yuk Facepalm :-@ :o) Pacman Shit Alien eyes Ass Warn Help Bad Love Joystick Boom Eggplant Floppy TV Ghost Note Msg


    CAPTCHA Image 

    Anonymous comments have a moderation delay and show up after 15 minutes