network datasets

1 datasets tagged "network"

AlphaZero Training Run: Policy and Value Network Convergence

212 iterations of AlphaZero-style self-play training tracking policy/value loss, MCTS agreement, game outcomes, and value calibration.