Catan AI Self-Play Training Metrics (171 Iterations)

AlphaZero-style training run for Catan: policy/value losses, game lengths, MCTS agreement, and value calibration across 171 self-play iterations.
# iteration
loss_policy_train
loss_value_train
loss_policy_val
loss_value_val
gradient_steps
game_length_avg
game_length_stddev
game_length_min
game_length_max
game_p1_wins
game_p2_wins
game_draws
policy_entropy_avg
policy_max_prob_avg
policy_entropy_high_branch_avg
policy_max_prob_high_branch_avg
policy_agreement_avg
value_z_avg
value_q_avg
value_z_stddev
value_q_stddev
value_correction_avg
value_q_spread_avg
value_error_early_avg
value_error_late_avg
value_network_stddev
bench_wins
bench_losses
bench_draws
lr
q_weight
mcts_sims
replay_samples
samples_iter
time_selfplay_secs
time_train_secs
time_bench_secs
+
1 1 2.177857 0.124353 2.182894 0.12368 753 391.82 28.621686 261 402 62 76 12 0.61276 0.750411 0.642879 0.764643 0.312216 0.056454 -0.03952 0.59048 0.081665 0.105695 0.062848 0.54799 0.562514 0.018912 0 0 0 0.001 0.01 400 80233 80233 132.927991 40.621692 0.0
2 2 1.902223 0.091472 1.929586 0.096489 1950 398.213333 19.146309 253 402 56 81 13 0.678358 0.73538 0.967341 0.654916 0.584229 0.022963 0.058943 0.52021 0.420487 0.115382 0.059854 0.397581 0.35226 0.437281 0 0 0 0.001 0.02 404 207719 127486 206.234833 91.554848 0.0
3 3 1.791844 0.066477 1.781964 0.062388 3108 393.44 29.75802 208 402 61 74 15 0.542632 0.787347 0.74944 0.733701 0.52306 0.022825 -0.039515 0.559324 0.369987 0.135705 0.082413 0.44469 0.346395 0.403209 0 0 0 0.001 0.03 408 331267 123548 204.988928 146.1988 0.0
4 4 1.757926 0.058006 1.778482 0.052806 4194 350.506667 65.948338 180 402 52 88 10 0.60345 0.763272 0.83375 0.701288 0.51033 0.007438 0.002922 0.754719 0.443414 0.126479 0.073304 0.623355 0.531046 0.477275 0 0 0 0.001 0.04 412 447288 116021 192.218529 196.032728 0.0
5 5 1.73822 0.046324 1.773433 0.041718 5208 324.826667 80.868762 143 402 61 79 10 0.517782 0.79729 0.706892 0.746141 0.452112 0.026215 -0.039466 0.792385 0.502297 0.156855 0.090201 0.656838 0.536687 0.543355 0 0 0 0.001 0.05 416 555509 108221 178.446632 243.443996 0.0
6 6 1.723562 0.03834 1.744324 0.033786 6192 325.1 76.262245 113 402 80 66 4 0.53279 0.791909 0.756237 0.729138 0.466711 0.021808 -0.023995 0.81681 0.588114 0.161964 0.093601 0.736244 0.552058 0.632369 0 0 0 0.001 0.06 420 660469 104960 170.425003 289.131173 0.0
7 7 1.701725 0.033403 1.717879 0.036099 7035 295.6 78.039819 136 402 68 79 3 0.530392 0.792313 0.750406 0.731045 0.405151 0.01585 -0.022885 0.892591 0.61681 0.181807 0.110216 0.618008 0.525172 0.66685 0 0 0 0.001 0.07 424 750090 89621 157.841815 329.363986 0.0
8 8 1.692267 0.030236 1.707962 0.029044 7902 313.353333 80.451156 138 402 71 74 5 0.645966 0.745311 0.947782 0.658325 0.424487 0.028609 -0.006869 0.854752 0.688194 0.187782 0.106271 0.70025 0.526715 0.739344 0 0 0 0.001 0.08 428 842875 92785 158.513472 369.20086 0.0
9 9 1.675097 0.028836 1.693705 0.028374 8850 330.06 76.975947 160 402 61 80 9 0.51755 0.794526 0.75025 0.729088 0.469614 0.016469 -0.028901 0.771098 0.626623 0.197508 0.115065 0.66364 0.511526 0.682029 0 0 0 0.001 0.09 432 943794 100919 167.87355 413.524335 0.0
10 10 1.665112 0.028406 1.693577 0.031795 9633 297.633333 81.486229 151 402 56 91 3 0.552228 0.781994 0.794611 0.711062 0.38939 0.015664 0.004171 0.864636 0.578044 0.207535 0.116984 0.744521 0.529199 0.645008 0 0 0 0.001 0.1 436 1027489 83695 154.304071 449.613322 0.0
11 11 1.646423 0.028127 1.685998 0.030211 9663 287.626667 83.639827 134 402 76 69 5 0.571375 0.77349 0.809297 0.706168 0.420659 0.040658 -0.003013 0.885059 0.613561 0.210771 0.119772 0.740728 0.58017 0.678145 0 0 0 0.001 0.11 440 1030561 83305 153.096274 451.781107 0.0
12 12 1.64891 0.028125 1.681327 0.026443 9156 259.546667 76.436953 111 402 62 85 3 0.600805 0.763035 0.877221 0.680734 0.3954 0.03175 0.022479 0.937728 0.620071 0.216956 0.122459 0.816338 0.566607 0.687159 0 0 0 0.001 0.12 444 976555 73480 145.801105 428.110759 0.0
13 13 1.659477 0.029237 1.686066 0.029281 8766 277.053333 80.906018 124 402 65 81 4 0.629509 0.750777 0.89267 0.678505 0.406835 0.026879 -0.003832 0.898393 0.649197 0.195405 0.111542 0.785577 0.513682 0.709877 0 0 0 0.001 0.13 448 935023 82016 160.66426 410.00575 0.0
14 14 1.659065 0.029311 1.687179 0.027554 8463 278.88 80.705714 93 402 66 83 1 0.558252 0.778772 0.800991 0.710749 0.421479 0.042855 -0.019963 0.925287 0.626701 0.208242 0.123048 0.761978 0.574806 0.693063 0 0 0 0.001 0.14 452 902598 83596 152.796436 395.928797 0.0
15 15 1.651715 0.028745 1.699153 0.027429 8220 273.846667 78.767484 119 402 75 71 4 0.546855 0.783184 0.80513 0.709757 0.404153 0.014504 -0.039314 0.917848 0.63841 0.226237 0.129678 0.75782 0.529304 0.713523 0 0 0 0.001 0.15 456 876529 82152 156.620852 384.916445 0.0
16 16 1.63863 0.028677 1.682146 0.032831 8013 273.353333 85.202906 125 402 79 65 6 0.660473 0.740938 0.97604 0.6558 0.444775 0.027763 0.006443 0.878953 0.706826 0.185692 0.104613 0.646433 0.430996 0.760794 0 0 0 0.000999 0.16 460 854719 83150 156.167948 375.117337 0.0
17 17 1.638164 0.028545 1.689154 0.032277 7881 261.613333 73.666436 127 402 73 74 3 0.641093 0.748648 0.941923 0.661926 0.398344 0.042384 0.017714 0.937882 0.660024 0.216971 0.12242 0.811684 0.597283 0.728293 0 0 0 0.000999 0.17 464 840445 75347 143.609766 368.284822 0.0
18 18 1.629325 0.029442 1.667642 0.032771 7704 260.24 73.388072 106 402 81 68 1 0.612184 0.759373 0.888034 0.681898 0.418145 0.04637 0.011882 0.965223 0.668369 0.204363 0.116589 0.822869 0.597258 0.729912 0 0 0 0.000999 0.18 468 821620 73960 142.026246 360.452244 0.0
19 19 1.629657 0.02975 1.674164 0.030596 7497 264.986667 75.536215 119 402 94 53 3 0.642174 0.747584 0.949825 0.660774 0.406496 0.031596 -0.008726 0.917729 0.679771 0.207674 0.116379 0.735946 0.530219 0.744985 0 0 0 0.000999 0.19 472 799462 78761 164.097355 351.170964 0.0
20 20 1.621956 0.02973 1.666982 0.030287 7479 263.773333 82.787612 126 402 83 66 1 0.529097 0.793035 0.798471 0.713427 0.430797 0.038585 -0.035744 0.940727 0.634446 0.235682 0.13385 0.887128 0.590103 0.713741 5 15 0 0.000999 0.2 476 797548 81781 157.645028 350.29052 70.323937
21 21 1.615424 0.029962 1.66912 0.035078 7452 262.74 74.795938 104 402 66 82 2 0.578587 0.772986 0.850936 0.697067 0.417778 0.035451 -0.003737 0.92419 0.666441 0.214449 0.12123 0.824101 0.583499 0.732687 0 0 0 0.000999 0.21 480 794635 80392 151.519962 348.596229 0.0
22 22 1.610563 0.030043 1.657237 0.031245 7413 235.246667 71.42912 114 402 83 64 3 0.593209 0.768367 0.889448 0.680834 0.410274 0.041596 0.020359 0.966012 0.650519 0.221841 0.126922 0.856921 0.599133 0.720751 0 0 0 0.000999 0.22 484 790477 69322 153.447157 346.742354 0.0
23 23 1.601696 0.03038 1.638209 0.028958 7287 232.48 71.782609 92 402 93 55 2 0.546446 0.786669 0.816034 0.709917 0.386949 0.024405 -0.033145 0.956347 0.651602 0.231087 0.13777 0.827723 0.593883 0.72616 0 0 0 0.000999 0.23 488 777116 68655 161.697546 340.763231 0.0
24 24 1.594087 0.030507 1.648252 0.030441 7176 245.08 73.087301 109 402 73 77 0 0.587671 0.771454 0.884587 0.68652 0.393034 0.048797 -0.033028 0.957077 0.666394 0.222219 0.131039 0.82538 0.552867 0.735619 0 0 0 0.000999 0.24 492 765440 71920 152.399912 335.950243 0.0
25 25 1.59305 0.030947 1.648104 0.033068 7095 249.493333 69.892083 106 402 93 54 3 0.592466 0.768898 0.877182 0.688424 0.397009 0.027363 -0.016728 0.949702 0.669554 0.213836 0.123635 0.823397 0.516871 0.736685 0 0 0 0.000999 0.25 496 756581 73293 146.124654 333.190658 0.0
26 26 1.587472 0.030964 1.635905 0.031902 6921 230.426667 68.432677 111 402 77 73 0 0.569069 0.778745 0.840771 0.700031 0.358397 0.017939 -0.025659 0.992282 0.659718 0.213555 0.126793 0.818489 0.552041 0.722156 0 0 0 0.000999 0.26 500 738094 64663 143.481959 323.247039 0.0
27 27 1.578224 0.03067 1.635418 0.029875 6936 248.5 74.729891 116 402 70 79 1 0.626867 0.756123 0.916041 0.677749 0.412743 0.042321 -0.000995 0.963463 0.687066 0.180408 0.10544 0.729547 0.490691 0.73823 0 0 0 0.000998 0.27 504 739589 76842 154.627389 325.439352 0.0
28 28 1.57372 0.031413 1.63087 0.031917 6897 230.593333 67.78339 93 402 69 79 2 0.524596 0.795776 0.771164 0.726547 0.386382 0.02484 -0.011718 0.972239 0.631042 0.218694 0.127754 0.880288 0.563812 0.698448 0 0 0 0.000998 0.28 508 735376 69747 158.093002 322.912704 0.0
29 29 1.567569 0.031121 1.635012 0.032751 6816 238.366667 71.386219 102 402 83 66 1 0.580231 0.776023 0.870991 0.69433 0.383891 0.020395 -0.020487 0.955869 0.671013 0.199377 0.116749 0.828539 0.501843 0.727814 0 0 0 0.000998 0.29 512 727023 70408 152.035249 319.083833 0.0
30 30 1.570939 0.031684 1.624119 0.032922 6663 234.486667 65.223384 124 402 79 69 2 0.625441 0.757914 0.929837 0.670856 0.366303 0.039829 0.014345 0.974348 0.669091 0.206998 0.120057 0.830495 0.501232 0.730588 0 0 0 0.000998 0.3 516 710434 65192 143.237232 311.937701 0.0
31 31 1.566704 0.032022 1.624758 0.031253 6537 228.293333 69.660562 107 402 67 83 0 0.540945 0.790562 0.791185 0.720977 0.372057 0.030218 -0.010836 0.977065 0.625018 0.199681 0.11641 0.846948 0.610617 0.682621 0 0 0 0.000998 0.31 520 697072 67030 143.884323 306.054482 0.0
32 32 1.56147 0.032651 1.619309 0.034104 6606 251.42 71.775973 127 402 78 72 0 0.532251 0.792788 0.777483 0.724596 0.395102 0.014569 -0.023057 0.955358 0.629825 0.206405 0.120189 0.820394 0.56808 0.691144 0 0 0 0.000998 0.32 524 704353 76603 162.613905 310.39337 0.0
33 33 1.567229 0.032542 1.617486 0.034484 6633 242.533333 75.32407 120 402 72 78 0 0.590177 0.770619 0.868569 0.692376 0.397274 0.043553 0.010237 0.955893 0.634313 0.194086 0.109601 0.813757 0.600268 0.691735 0 0 0 0.000998 0.33 528 707301 71603 164.728006 312.173662 0.0
34 34 1.560991 0.032652 1.614261 0.032424 6606 219.433333 65.408401 107 402 90 60 0 0.534714 0.792152 0.765159 0.730709 0.399479 0.014754 -0.032554 0.97484 0.632762 0.196549 0.114687 0.866354 0.59685 0.683481 0 0 0 0.000998 0.34 532 704516 69135 167.60455 309.253722 0.0
35 35 1.558181 0.032277 1.636611 0.034991 6561 230.406667 72.929381 106 402 74 75 1 0.561953 0.781957 0.820342 0.71103 0.374823 0.034984 -0.019448 0.98091 0.648781 0.193036 0.111907 0.803687 0.536948 0.698974 0 0 0 0.000997 0.35 536 699626 68403 153.812481 307.578315 0.0
36 36 1.558286 0.032042 1.617023 0.032411 6594 226.16 66.215213 97 402 86 63 1 0.59112 0.772012 0.858764 0.699205 0.399008 0.029936 -0.009698 0.973595 0.64437 0.183551 0.105577 0.750106 0.536973 0.693861 0 0 0 0.000997 0.36 540 703325 68362 150.663214 308.813549 0.0
37 37 1.559312 0.033397 1.617 0.032549 6423 214.846667 62.215511 109 402 79 70 1 0.58052 0.776317 0.8452 0.70137 0.359487 0.035577 0.012701 0.987165 0.619705 0.193674 0.111035 0.895644 0.58045 0.671031 0 0 0 0.000997 0.37 544 684919 58436 131.986347 301.543617 0.0
38 38 1.558566 0.033686 1.621288 0.036082 6390 223.726667 63.07402 112 402 71 78 1 0.518799 0.799169 0.753616 0.732456 0.386646 0.017054 -0.014122 0.980326 0.605653 0.18807 0.109872 0.922996 0.634522 0.654013 0 0 0 0.000997 0.38 548 681491 66319 158.740409 298.77985 0.0
39 39 1.55684 0.033339 1.628934 0.035311 6363 226.926667 65.029849 109 402 83 66 1 0.510535 0.803212 0.73773 0.740378 0.365296 0.026106 -0.020866 0.980479 0.610263 0.190622 0.111634 0.839489 0.584141 0.655357 0 0 0 0.000997 0.39 552 678442 67359 161.806134 297.924591 0.0
40 40 1.554609 0.034597 1.616441 0.034891 6312 213.166667 57.784937 100 402 74 76 0 0.511438 0.802002 0.736125 0.739036 0.340401 0.031723 -0.010342 0.99383 0.594143 0.197588 0.115057 0.886636 0.613755 0.643691 4 16 0 0.000997 0.4 556 673144 59894 128.44055 295.713714 108.94367
41 41 1.552239 0.035246 1.626626 0.036039 6270 218.786667 65.360496 100 402 65 83 2 0.549224 0.789625 0.796688 0.719362 0.349428 0.021146 -0.006815 0.979837 0.607945 0.177222 0.104605 0.860389 0.562566 0.649568 0 0 0 0.000996 0.41 560 668536 62422 145.502664 294.384166 0.0
42 42 1.549737 0.035433 1.631427 0.038659 6108 210.56 59.212102 105 402 88 62 0 0.514015 0.802018 0.739924 0.73759 0.375133 0.02804 -0.008127 0.983531 0.586405 0.186208 0.108792 0.837036 0.573954 0.635127 0 0 0 0.000996 0.42 564 651312 59379 141.837917 286.273724 0.0
43 43 1.547777 0.0356 1.617633 0.036411 5985 210.626667 55.836493 87 402 73 76 1 0.525893 0.797243 0.772012 0.72773 0.363168 0.033037 -0.011672 0.990287 0.621017 0.187124 0.110293 0.832489 0.577083 0.665 0 0 0 0.000996 0.43 568 638098 58389 135.866092 280.916251 0.0
44 44 1.54754 0.03636 1.612705 0.035422 5904 216.9 66.816739 95 402 74 74 2 0.540119 0.792066 0.77279 0.726889 0.365326 0.017509 0.010429 0.973632 0.592328 0.185584 0.108457 0.891784 0.619282 0.64144 0 0 0 0.000996 0.44 572 629561 60598 143.43193 276.385936 0.0
45 45 1.544196 0.03638 1.615983 0.03633 5820 212.686667 60.849008 102 402 73 77 0 0.542294 0.792359 0.777828 0.726391 0.346915 0.050365 -0.005834 0.993878 0.601538 0.17158 0.101021 0.889434 0.631726 0.642335 0 0 0 0.000996 0.45 576 620723 59565 146.070702 272.569344 0.0
46 46 1.540623 0.037376 1.613946 0.037558 5718 210.12 62.404478 81 402 71 79 0 0.542125 0.792925 0.79023 0.722749 0.367956 0.044029 -0.008696 0.979136 0.608203 0.171207 0.100273 0.904836 0.593879 0.645643 0 0 0 0.000996 0.46 580 609914 57553 148.543038 268.404939 0.0
47 47 1.536305 0.03613 1.624177 0.037055 5739 212.413333 65.222203 91 402 89 60 1 0.5393 0.792448 0.766621 0.730846 0.369594 0.048676 0.006767 0.979521 0.601863 0.160611 0.093815 0.826444 0.539768 0.639102 0 0 0 0.000995 0.47 584 611847 60369 153.636133 269.630994 0.0
48 48 1.540088 0.03724 1.617354 0.038775 5670 212.693333 59.431748 102 402 83 67 0 0.564287 0.784145 0.805751 0.716357 0.354726 0.031954 0.021182 0.976983 0.606385 0.15972 0.093204 0.866229 0.645238 0.642776 0 0 0 0.000995 0.48 588 604675 59147 141.940928 265.981365 0.0
49 49 1.540653 0.038126 1.611965 0.039577 5577 209.346667 62.829822 93 402 80 70 0 0.544129 0.792538 0.789947 0.722905 0.350968 0.038368 0.004552 0.978255 0.598763 0.162156 0.094525 0.842521 0.619762 0.630705 0 0 0 0.000995 0.49 592 594760 57444 153.518449 260.685346 0.0
50 50 1.539367 0.03823 1.613155 0.038229 5577 211.893333 62.985728 86 402 75 73 2 0.537998 0.793995 0.778208 0.726861 0.363793 0.043982 -0.001452 0.979405 0.601201 0.163406 0.093523 0.860666 0.582315 0.638133 0 0 0 0.000995 0.5 596 594606 59740 145.505922 261.806925 0.0
51 51 1.537162 0.037599 1.625579 0.038308 5514 204.446667 63.985367 93 402 82 67 1 0.543181 0.792592 0.769955 0.727859 0.364482 0.039641 0.02147 0.986275 0.587948 0.153374 0.089428 0.837199 0.561991 0.621087 0 0 0 0.000994 0.51 600 587896 55712 136.812165 259.056852 0.0
52 52 1.534987 0.03725 1.623439 0.037992 5520 211.593333 63.592567 97 402 73 76 1 0.519912 0.800057 0.747446 0.735317 0.381707 0.045467 -0.000933 0.980061 0.587642 0.157543 0.090398 0.833884 0.55099 0.621701 0 0 0 0.000994 0.52 604 588736 60219 158.718149 258.855555 0.0
53 53 1.532366 0.03674 1.62096 0.036283 5526 204.753333 66.756267 91 402 83 67 0 0.524481 0.79887 0.7379 0.740233 0.369439 0.035199 0.007516 0.976518 0.58338 0.150607 0.087938 0.798092 0.584842 0.615441 0 0 0 0.000994 0.53 608 589369 59022 147.55106 259.712024 0.0
54 54 1.533095 0.037244 1.609196 0.036826 5493 202.62 62.730819 105 402 77 73 0 0.503773 0.807697 0.725582 0.74324 0.375825 0.031421 0.01122 0.986239 0.567938 0.157992 0.091773 0.893239 0.636013 0.602851 0 0 0 0.000994 0.54 612 585707 56936 141.865579 258.267036 0.0
55 55 1.528917 0.037562 1.629478 0.038965 5445 193.36 58.978333 86 402 80 70 0 0.502011 0.808501 0.707257 0.749382 0.350166 0.023689 -0.00335 0.990828 0.565597 0.15582 0.091074 0.853301 0.572431 0.597501 0 0 0 0.000994 0.55 616 580639 54497 167.222014 255.214215 0.0
56 56 1.524806 0.037195 1.601233 0.038104 5424 189.613333 56.880434 88 402 83 66 1 0.48504 0.814832 0.682248 0.75881 0.357712 0.017433 -0.000191 0.991767 0.556566 0.15841 0.091839 0.853038 0.613983 0.589744 0 0 0 0.000993 0.56 620 578326 55240 171.782811 254.976438 0.0
57 57 1.522011 0.038064 1.591999 0.038703 5322 183.673333 47.795885 96 369 83 67 0 0.543674 0.793504 0.766238 0.729255 0.345275 0.031849 0.026379 0.999493 0.563399 0.145125 0.084297 0.892368 0.603388 0.590268 0 0 0 0.000993 0.57 624 567503 49546 112.032018 250.256521 0.0
58 58 1.514364 0.037788 1.607315 0.038048 5295 195.926667 55.807538 85 402 77 72 1 0.475004 0.818697 0.678328 0.758749 0.379211 0.023075 0.01003 0.989766 0.534531 0.149387 0.086407 0.895192 0.626664 0.566467 0 0 0 0.000993 0.58 628 564781 56425 151.29712 249.140621 0.0
59 59 1.50509 0.037655 1.599532 0.038077 5289 194.466667 59.078103 94 402 62 87 1 0.495511 0.810681 0.698428 0.752249 0.391566 0.060725 0.014176 0.988593 0.559565 0.138837 0.080241 0.877019 0.592959 0.587169 0 0 0 0.000993 0.59 632 564035 56698 145.2654 248.58842 0.0
60 60 1.504368 0.037308 1.597592 0.037039 5250 188.226667 61.123061 88 402 65 83 2 0.52118 0.801091 0.73364 0.741149 0.372824 0.042843 0.01769 0.973396 0.578647 0.141732 0.081932 0.803885 0.558896 0.605724 6 14 0 0.000992 0.6 636 559777 55482 150.291003 246.3868 103.87174
61 61 1.498907 0.037369 1.588539 0.038619 5235 189.44 54.856599 92 402 87 63 0 0.529099 0.798404 0.751478 0.734957 0.373804 0.054068 0.02642 0.998537 0.569659 0.128836 0.075596 0.905084 0.600533 0.592148 0 0 0 0.000992 0.61 640 558219 54154 163.572653 246.772307 0.0
62 62 1.49634 0.037353 1.587958 0.038842 5112 172.106667 44.695137 80 305 82 68 0 0.527259 0.799799 0.739108 0.738854 0.377909 0.04405 0.042491 0.999029 0.53076 0.135945 0.079238 0.888119 0.613253 0.557289 0 0 0 0.000992 0.62 644 544969 46969 134.543029 240.782982 0.0
63 63 1.488345 0.037782 1.586897 0.03919 5028 181.0 48.880262 73 373 66 84 0 0.51243 0.805038 0.725202 0.743096 0.371756 0.053548 0.031965 0.998565 0.544339 0.134452 0.077825 0.887744 0.590267 0.570283 0 0 0 0.000991 0.63 648 536238 50291 130.965067 236.029003 0.0
64 64 1.486837 0.037217 1.568208 0.037153 4971 178.08 53.509192 90 392 66 84 0 0.509512 0.806475 0.707163 0.751163 0.374492 0.044267 0.027994 0.99902 0.537742 0.124719 0.074065 0.887032 0.593152 0.557995 0 0 0 0.000991 0.64 652 530040 50738 148.16565 233.320156 0.0
65 65 1.473768 0.036746 1.578961 0.037707 4902 168.54 48.72626 93 395 81 69 0 0.52594 0.800262 0.734652 0.740228 0.363648 0.040008 0.040568 0.999199 0.542233 0.125586 0.074637 0.885907 0.567868 0.56311 0 0 0 0.000991 0.65 656 522858 47315 139.815925 229.940846 0.0
66 66 1.47112 0.03589 1.568183 0.035737 4833 168.993333 51.715955 86 402 86 64 0 0.539593 0.795159 0.746546 0.737035 0.39123 0.043622 0.032068 0.99236 0.557272 0.116241 0.068917 0.857758 0.585853 0.576401 0 0 0 0.000991 0.66 660 515255 47637 156.474439 226.371251 0.0
67 67 1.466088 0.035271 1.568818 0.035527 4785 167.133333 40.283275 88 315 80 70 0 0.56444 0.789242 0.798985 0.720557 0.370048 0.063224 0.038055 0.997999 0.564929 0.115837 0.069258 0.864581 0.568048 0.582251 0 0 0 0.00099 0.67 664 510360 44651 121.307568 224.246669 0.0
68 68 1.463579 0.034354 1.56131 0.034149 4689 169.073333 42.810138 87 283 79 71 0 0.580849 0.781474 0.82241 0.71136 0.384685 0.058464 0.050317 0.99829 0.566069 0.111701 0.0663 0.848528 0.547513 0.58195 0 0 0 0.00099 0.68 668 500100 46165 119.426738 219.778047 0.0
69 69 1.458075 0.033469 1.566126 0.03361 4608 174.493333 58.881774 69 402 76 73 1 0.599109 0.775036 0.847795 0.703812 0.37899 0.049849 0.051254 0.989509 0.581519 0.10398 0.061217 0.816207 0.519762 0.595847 0 0 0 0.00099 0.69 672 491427 48025 137.791537 215.037868 0.0
70 70 1.488383 0.030107 1.66096 0.028215 441 176.206667 53.578017 89 426 73 77 0 0.587756 0.777823 0.823747 0.712017 0.385926 0.064324 0.071087 0.997929 0.567788 0.109384 0.062809 0.833387 0.521266 0.583615 0 0 0 0.000989 0.7 676 46965 46965 141.018056 26.119628 0.0
71 71 1.474764 0.029598 1.562822 0.029367 873 168.286667 52.299565 62 448 67 82 1 0.572688 0.78355 0.810838 0.714731 0.384995 0.056129 0.073172 0.985537 0.556593 0.108297 0.062715 0.816718 0.536859 0.568715 0 0 0 0.000989 0.71 680 93002 46037 151.801074 40.54773 0.0
72 72 1.432108 0.029155 1.574874 0.029011 1326 173.42 54.370613 80 452 71 78 1 0.586985 0.778045 0.838058 0.707036 0.397097 0.075769 0.082942 0.985174 0.569617 0.100708 0.058536 0.880311 0.545579 0.582698 0 0 0 0.000989 0.72 684 141373 48371 148.486988 61.64352 0.0
73 73 1.407748 0.027925 1.546813 0.028242 1782 168.146667 52.385289 74 464 76 74 0 0.590531 0.777745 0.854537 0.700989 0.417487 0.07491 0.086466 0.987427 0.57452 0.100792 0.058487 0.836107 0.522204 0.585172 0 0 0 0.000989 0.73 688 189911 48538 157.282014 83.522198 0.0
74 74 1.398901 0.028182 1.522222 0.028448 2229 173.853333 55.221178 78 436 86 64 0 0.578097 0.781829 0.832407 0.706428 0.422923 0.075879 0.069247 0.997117 0.56213 0.104214 0.059414 0.883376 0.55997 0.572471 0 0 0 0.000988 0.74 692 237513 47602 144.291212 104.036748 0.0
75 75 1.383446 0.026344 1.4958 0.027053 2664 164.853333 55.647987 81 456 83 67 0 0.62374 0.766084 0.883235 0.691797 0.4148 0.107323 0.102512 0.994224 0.572736 0.091317 0.053702 0.790814 0.499435 0.580813 0 0 0 0.000988 0.75 696 284027 46514 150.679239 123.707568 0.0
76 76 1.375443 0.02622 1.479962 0.026508 3123 170.78 48.470179 76 339 87 63 0 0.581123 0.78086 0.834549 0.706487 0.439487 0.085911 0.095956 0.996303 0.553966 0.099326 0.056411 0.856153 0.573873 0.563426 0 0 0 0.000988 0.76 700 333066 49039 122.656637 145.431453 0.0
77 77 1.369653 0.025641 1.474705 0.02582 3630 178.2 60.093261 66 476 76 74 0 0.553695 0.790323 0.782933 0.725327 0.442964 0.066558 0.065767 0.989264 0.546507 0.093234 0.054808 0.853867 0.603877 0.558101 0 0 0 0.000987 0.77 704 386944 53878 175.590192 168.820594 0.0
78 78 1.368864 0.024585 1.441731 0.024692 4038 161.04 56.212203 80 472 83 66 1 0.583576 0.780849 0.834097 0.707139 0.420789 0.091572 0.071148 0.983962 0.531982 0.093482 0.055345 0.837482 0.510609 0.540732 0 0 0 0.000987 0.78 708 430429 43485 155.799792 187.757302 0.0
79 79 1.357307 0.024225 1.443368 0.024196 4500 168.286667 56.621119 86 452 74 76 0 0.571506 0.785042 0.815409 0.714648 0.440893 0.08234 0.080652 0.996604 0.552127 0.087945 0.052261 0.894362 0.555014 0.562609 0 0 0 0.000987 0.79 712 479992 49563 168.032371 208.854678 0.0
80 80 1.359172 0.023147 1.44499 0.024131 4506 160.053333 47.070413 81 355 76 74 0 0.58725 0.778856 0.849013 0.702936 0.437388 0.103092 0.088055 0.994672 0.552507 0.08446 0.05115 0.814653 0.502386 0.561455 7 13 0 0.000986 0.8 716 480470 47443 134.078857 209.283562 82.809687
81 81 1.354259 0.022155 1.436613 0.022287 4530 174.673333 51.995576 92 424 67 83 0 0.631728 0.763051 0.917348 0.680861 0.428205 0.093489 0.10759 0.99562 0.56514 0.084042 0.048784 0.871195 0.554412 0.571364 0 0 0 0.000986 0.81 720 483134 48701 138.416003 210.733598 0.0
82 82 1.34267 0.021077 1.435097 0.021093 4536 165.986667 46.923766 82 464 65 85 0 0.582321 0.781573 0.850323 0.70237 0.459051 0.073722 0.077459 0.987998 0.550454 0.079541 0.04823 0.827227 0.525388 0.558549 0 0 0 0.000985 0.82 724 483690 48927 164.499786 210.80226 0.0
83 83 1.334916 0.020014 1.412305 0.019933 4530 159.08 44.15692 78 360 74 76 0 0.592366 0.778888 0.885761 0.692055 0.472596 0.091 0.091072 0.995851 0.558672 0.075757 0.045791 0.832606 0.552127 0.566264 0 0 0 0.000985 0.83 728 483064 47912 143.325245 210.018932 0.0
84 84 1.320801 0.018759 1.413769 0.018689 4494 149.786667 33.074983 90 247 72 78 0 0.618032 0.770175 0.912275 0.682472 0.452627 0.09137 0.097591 0.995817 0.542537 0.072505 0.044609 0.858677 0.512491 0.546259 0 0 0 0.000985 0.84 732 479295 43833 116.608586 208.768892 0.0
85 85 1.306507 0.017477 1.406928 0.017756 4503 155.86 57.245091 65 482 73 76 1 0.609041 0.772902 0.886897 0.691826 0.460083 0.09792 0.092539 0.972051 0.562883 0.068826 0.042974 0.841526 0.472798 0.568574 0 0 0 0.000984 0.85 736 480279 47498 177.288298 209.246887 0.0
86 86 1.299897 0.016266 1.380982 0.016787 4470 151.773333 50.991914 58 464 78 71 1 0.610763 0.771808 0.889032 0.690558 0.462581 0.112346 0.098687 0.978461 0.551931 0.069122 0.042898 0.862449 0.518519 0.557848 0 0 0 0.000984 0.86 740 476484 45244 167.075851 208.374959 0.0
87 87 1.291683 0.014878 1.387179 0.014777 4410 158.8 50.685172 78 373 76 74 0 0.616007 0.769281 0.914072 0.681406 0.47769 0.072042 0.090502 0.997402 0.538995 0.070346 0.041693 0.884083 0.534341 0.543023 0 0 0 0.000984 0.87 744 470231 47625 136.23597 205.68246 0.0
88 88 1.276206 0.01361 1.361058 0.013747 4416 149.48 39.520749 67 294 84 66 0 0.625509 0.766629 0.919159 0.680609 0.466916 0.102841 0.107428 0.994698 0.553664 0.065421 0.041321 0.872865 0.512311 0.557365 0 0 0 0.000983 0.88 748 470921 44175 128.959367 205.579953 0.0
89 89 1.26582 0.012652 1.334122 0.012688 4410 159.946667 46.317856 81 321 70 80 0 0.619737 0.768898 0.922372 0.679422 0.477497 0.084948 0.100492 0.996385 0.553345 0.063166 0.039841 0.863314 0.566525 0.557169 0 0 0 0.000983 0.89 752 470129 48771 133.551086 204.811722 0.0
90 90 1.245353 0.011599 1.34329 0.011351 4419 155.16 48.16722 79 410 78 72 0 0.599586 0.776025 0.909645 0.684688 0.502662 0.106282 0.087272 0.994336 0.55615 0.060802 0.039166 0.83112 0.517747 0.558748 0 0 0 0.000982 0.9 756 471142 48456 154.780728 205.069333 0.0
91 91 1.231075 0.010503 1.319003 0.010858 4422 152.653333 51.759828 75 480 91 59 0 0.617663 0.768489 0.922022 0.679949 0.508298 0.117142 0.092242 0.993115 0.560899 0.057715 0.036939 0.821535 0.500802 0.561459 0 0 0 0.000982 0.91 760 471672 49231 174.222512 205.558115 0.0
92 92 1.220551 0.009481 1.298685 0.009435 4419 153.613333 42.003141 75 298 78 72 0 0.586753 0.781249 0.877059 0.694938 0.492815 0.097038 0.087271 0.995281 0.548365 0.058005 0.03928 0.823424 0.519597 0.552682 0 0 0 0.000982 0.92 764 471252 48507 139.452272 205.552256 0.0
93 93 1.203439 0.008653 1.294121 0.008609 4446 156.473333 50.067514 72 466 75 74 1 0.578358 0.783337 0.873327 0.695097 0.521095 0.078653 0.091806 0.985001 0.539902 0.055778 0.036628 0.849151 0.567887 0.54279 0 0 0 0.000981 0.93 768 474133 50793 179.233008 206.607231 0.0
94 94 1.192465 0.007883 1.290785 0.008123 4506 152.653333 43.848829 70 358 76 74 0 0.581937 0.78158 0.878423 0.693436 0.52026 0.076251 0.076771 0.997089 0.546291 0.054886 0.036704 0.870103 0.583979 0.547934 0 0 0 0.000981 0.94 772 480398 50098 155.006472 208.905319 0.0
95 95 1.170152 0.007046 1.272861 0.00708 4545 155.066667 45.30352 85 352 84 66 0 0.633573 0.761347 0.959413 0.666676 0.532404 0.116085 0.111891 0.993239 0.572005 0.047132 0.030527 0.81868 0.498181 0.572067 0 0 0 0.00098 0.95 776 484776 51876 152.402599 211.507746 0.0
96 96 1.157608 0.006311 1.236152 0.006333 4620 160.433333 57.149094 65 478 71 78 1 0.612605 0.770494 0.926705 0.679357 0.513683 0.108658 0.089895 0.982432 0.561347 0.047023 0.032643 0.800877 0.4736 0.563805 0 0 0 0.00098 0.96 780 492699 53167 182.748425 214.754438 0.0
97 97 1.144429 0.005792 1.221046 0.005765 4686 159.44 55.636257 80 480 76 73 1 0.585215 0.779131 0.886584 0.690412 0.520871 0.0875 0.093524 0.983911 0.536814 0.050243 0.033825 0.850425 0.565187 0.539672 0 0 0 0.00098 0.97 784 499600 54526 196.708317 218.272121 0.0
98 98 1.124896 0.005363 1.208942 0.005333 4746 149.726667 42.459376 69 291 83 67 0 0.580313 0.78326 0.902797 0.685193 0.534717 0.110218 0.082313 0.993907 0.54732 0.046064 0.033812 0.826081 0.517889 0.548661 0 0 0 0.000979 0.98 788 505934 50509 150.848311 220.493069 0.0
99 99 1.108953 0.004885 1.196395 0.005095 4782 155.546667 49.294569 72 386 72 78 0 0.618549 0.76843 0.953668 0.668294 0.536758 0.111242 0.099377 0.993793 0.551232 0.04281 0.030261 0.845394 0.502447 0.5519 0 0 0 0.000979 0.99 792 509859 52696 181.599408 221.607086 0.0
100 100 1.095481 0.004618 1.18602 0.004606 4848 160.813333 49.177961 67 382 70 80 0 0.568135 0.785688 0.8628 0.698332 0.537481 0.120326 0.089063 0.992734 0.53773 0.043714 0.032968 0.852534 0.503032 0.541013 19 1 0 0.000978 1.0 796 517085 55682 163.979324 225.41637 87.039994

Catan AI Self-Play Training Metrics (171 Iterations) — AI Analysis

Policy loss dropped 64% but plateaued after iteration 30 — the network has likely stopped improving from self-play alone

Game length collapsed from 392 to 157 moves, suggesting the agent learned to close out games decisively

Training Summary

  • Policy loss dropped 64% (2.18 → 0.79) over 171 iterations, with most improvement in the first 30 iterations before plateauing
  • Value loss collapsed 99% (0.124 → 0.002), indicating the network learned accurate game-outcome predictions almost immediately
  • Average game length fell 60% (392 → 157 moves) as the agent learned to play decisively — draws disappeared entirely after iteration ~40
  • Policy agreement with MCTS peaked around 60%, meaning the network frequently disagrees with tree search
  • Value error in early positions worsened (0.55 → 0.84) while late-game error stayed flat, suggesting the agent still struggles with opening evaluation

Visualizations

Draws per Batch
Value Prediction Error: Early vs Late Game
Average Game Length
Value Loss Over Training

This dataset contains 171 records across 38 fields: iteration, loss_policy_train, loss_value_train, loss_policy_val, loss_value_val, gradient_steps, and 32 more.

171 rows · 38 columns · 2026-03-24

Embed this data story

Embedding launches soon. You'll be able to embed interactive charts and data tables on any website.