catan datasets
2 datasets tagged "catan"
Catan AI Self-Play Training Metrics (171 Iterations)
AlphaZero-style training run for Catan: policy/value losses, game lengths, MCTS agreement, and value calibration across 171 self-play iterations.
Catan RL Training — Implementation
171 training iterations of a Catan RL implementation. Tracks policy and value loss convergence, game length evolution, and self-play performance metri...