[Computer-go] Exploration formulas for UCT

Aja ajahuang at gmail.com
Sat Jan 1 19:51:05 PST 2011


   Hi petr,

>  We use the Silver formula:
>
> rave_visits / (rave_visits + real_visits + rave_visits * real_visits * 
> 3000)
>
> The figure of 3000 is surprisingly resilient. Even with radically
> different heuristics and playouts, it stays the empirical optimum.

   Interesting. According to Sylvain's original post here, that means you 
set bias to sqrt(3000/4)=27.386... But is not bias should be in the range 
[0,1]?

  Aja
 




More information about the Computer-go mailing list