1 datasets tagged "iteration"
Policy loss, value loss, game length, and MCTS agreement metrics across 62 iterations of AlphaZero-style reinforcement learning.