[Computer-go] Exploration formulas for UCT
Aja
ajahuang at gmail.com
Sat Jan 1 19:51:05 PST 2011
Hi petr,
> We use the Silver formula:
>
> rave_visits / (rave_visits + real_visits + rave_visits * real_visits *
> 3000)
>
> The figure of 3000 is surprisingly resilient. Even with radically
> different heuristics and playouts, it stays the empirical optimum.
Interesting. According to Sylvain's original post here, that means you
set bias to sqrt(3000/4)=27.386... But is not bias should be in the range
[0,1]?
Aja
More information about the Computer-go
mailing list