[Computer-go] Value network that doesn't want to learn.
Gian-Carlo Pascutto
gcp at sjeng.org
Mon Jun 19 13:48:46 PDT 2017
On 19/06/2017 21:31, Vincent Richard wrote:
> - The data is then analyzed by a script which extracts all kind of
> features from games. When I'm training a network, I load the features I
> want from this analysis to build the batch. I have 2 possible methods
> for the batch construction. I can either add moves one after the other
> (the fast mode) or pick random moves among different games (slower but
> reduces the variance).
You absolutely need the latter, especially as for outcome prediction the
moves from the same game are not independent samples.
> During sime of the tests, all the networks I was training had the same
> layers except for the last. So as you suggested, I was also wondering if
> this last layer wasn’t the problem. Yet, I haven’t found any error.
...
> However, if I feed a stupid
> value as target output (for example black always win) it has no trouble
> learning.
A problem with side to move/won side marking in the input or feature
planes, or with the expected outcome (0 vs 1 vs -1)?
--
GCP
More information about the Computer-go
mailing list