92 skills found · Page 1 of 4
leela-zero / Leela ZeroGo engine with no human-provided knowledge, modeled after the AlphaGo Zero paper.
suragnair / Alpha Zero GeneralA clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
junxiaosong / AlphaZero GomokuAn implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
pytorch / ELFELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation
Tencent / PhoenixGoGo AI program which implements the AlphaGo Zero paper
Zeta36 / Chess Alpha ZeroChess reinforcement learning by AlphaGo Zero methods.
NeymarL / ChineseChess AlphaZeroImplement AlphaZero/AlphaGo Zero methods on Chinese chess.
maxpumperla / Deep Learning And The Game Of GoCode and other material for the book "Deep Learning and the Game of Go"
mokemokechicken / Reversi Alpha ZeroReversi reinforcement learning by AlphaGo Zero methods.
chengstone / Cchess ZeroAlphaZero implemented Chinese chess. AlphaGo Zero / AlphaZero实践项目,实现中国象棋。
bupticybee / IcyChessZero中国象棋alpha zero程序
yhyu13 / AlphaGOZero Python TensorflowCongratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th publication: [Mastering the Game of Go without Human Knowledge]. The supervised learning approach is more practical for individuals. (This repository has single purpose of education only)
dylandjian / SuperGoA student implementation of Alpha Go Zero
initial-h / AlphaZero Gomoku MPIAn asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
michaelnny / Alpha ZeroA PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
llSourcell / Alphago DemoThis is the code for "How Does DeepMind's AlphaGo Zero Work?" Siraj Raval on Youtube
Zeta36 / Connect4 Alpha ZeroConnect4 reinforcement learning by AlphaGo Zero methods.
YoujiaZhang / AlphaGo Zero GobangAlphaGo-Zero-Gobang 是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程
Urinx / ReinforcementLearningReinforcing Your Learning of Reinforcement Learning
blanyal / Alpha ZeroAlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.