[Computer-go] AlphaZero paper difference between 2017 and 2018
Hiroshi Yamashita
yss at bd.mbn.or.jp
Thu Mar 28 02:11:02 PDT 2019
Hi,
I found AlphaZero paper Table S3 is different 2017 and 2018.
2017 2018
Mini-batches 700k 700k
Training Time 34h 13d
Training Games 21 million 140 million
Thinking Time 800 sims, 200ms 800 sims, 200ms
Training Time is 34h -> 13d 9.2 times
Training Games is 21 million -> 140 million 6.6 times
Chess and Shogi is same.
And Figure 1 is also a bit different in Shogi and Go. Chess looks same.
Why these numbers are so different? Is it typo?
AlphaZero(2017/12/05)
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
https://arxiv.org/abs/1712.01815
AlphaZero(2018/12/07)
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
https://deepmind.com/documents/260/alphazero_preprint.pdf
Thanks,
Hiroshi Yamashita
More information about the Computer-go
mailing list