103 datasets

1 datasets tagged "103"

AlphaZero-Style MCTS Training Metrics (103 Iterations)

Policy loss, value error, game length, and MCTS agreement metrics across 103 self-play training iterations.