Japanese
AobaZero is a user-participated Shogi AI project that will test the AlphaZero Shogi experiment.


If you are interested, please join us. Anyone can contribute using Google Colab.

GitHub Source, executable files. GitHub(Japanese top page)

2020-07-08 Drop the learning rate to 0.00002. (from 10980k games, w1450).
2020-06-28 Colab generates games 13 times faster(Tesla T4) by using develop branch.
2019-11-08 Drop the learning rate to 0.0002. (from 4340k games, w787).
2019-10-29 Drop the learning rate to 0.02. (from 4220k games, w775).
2019-07-09 v1.4 Update required. Random seed for visit count sampling was constant.


2020-07-12 12:03 JST(update every 30 minutes)
In past hour,number of clients are 8, 3334 games.
In past 24 hours, number of clients are 20, 79873 games.
Total 11309780 games. Latest weight= w1481. Next is in 0.4 hours. Thank you for your contribution!
In past 1000 games, Average of moves 139.4, Sente winrate 0.560, Draw rate 0.064
In past 500,000 games, Average of moves 136.2, Sente winrate 0.549, Draw rate 0.056

Elo progress. It is based on a self-match with the previous weight. Left vertical axis is Elo. Right is floodgate and vs Kristallweizen 1k,10k,50k,100k,200k. Horizontal axis is the weight for every 10,000 games.
As of 2020-07-10.

AobaZero 800playouts/move vs Kristallweizen 200k/move. 800 match games.


You can see the process of acquiring Shogi knowledge from the game records.
Self-play games without noise. Each game uses same weight.

You can see the transition of opening moves.


Self-play games for training.
Self-play games for every 10,000 games added. The top of the page is the latest game. It will be updated every other day.

These are self-play games for training. It often plays blunder for the first 30 moves.
And sometimes it choose a move that is not a best by adding noise on root node.


Game records
From arch000000000000.csa.xz to arch000009260000.csa.xz.
These will be updated each two weeks.
From
no000000000000.csa to
no000000121031.csa
 are generated by not using neural network, but random function.
The first game that is generated by neural network is
no000000121032.csa
no000001017999.csa. Up to here, 64x15block, window is past 100,000 games.
no000001018000.csa. From here, 256x20block, window is past 500,000 games.
Weights
From w000000000001.txt.xz to w000000001279.txt.xz.
Network size is 64 x 15 block up to w448, 256 x 20 block from w449.
w001  ...  64x15b,minibatch   64, learning rate 0.01,   wd 0.00005, momentum 0.9,   120000 games
w156  ...  64x15b,minibatch   64, learning rate 0.001,  wd 0.00005, momentum 0.9,   430000 games
w449  ... 256x20b,minibatch   64, learning rate 0.01,   wd 0.0002,  momentum 0.9,  1018000 games
w465  ... 256x20b,minibatch   64, learning rate 0.001,  wd 0.0002,  momentum 0.9,  1180000 games
w775  ... 256x20b,minibatch 4096, learning rate 0.02,   wd 0.0002,  momentum 0.9,  4220000 games
w787  ... 256x20b,minibatch  128, learning rate 0.0002, wd 0.0002,  momentum 0.9,  4340000 games
w1450 ... 256x20b,minibatch  128, learning rate 0.00002,wd 0.0002,  momentum 0.9, 10980000 games
Weights are updated each  2000 games ( 4000 iterations) up to w448.
Weights are updated each 10000 games (20000 iterations) from  w449.