AlphaZero-Style Training Metrics (177 Iterations)

iteration loss_policy_train loss_value_train loss_policy_val loss_value_val gradient_steps game_length_avg game_length_stddev game_length_min game_length_max game_wins game_losses game_draws policy_entropy_avg policy_max_prob_avg policy_entropy_high_branch_avg policy_max_prob_high_branch_avg policy_agreement_avg policy_agreement_high_branch_avg value_z_avg value_q_avg value_z_stddev value_q_stddev value_correction_avg value_correction_high_branch_avg value_q_spread_avg value_q_spread_high_branch_avg value_error_early_avg value_error_late_avg value_network_stddev bench_wins bench_losses bench_draws lr q_weight mcts_sims replay_samples samples_iter time_selfplay_secs time_train_secs time_bench_secs
1 1 2.407992 0.20431 2.360713 0.181675 186 345.903333 76.716452 117 412 140 147 13 1.314963 0.485351 1.94498 0.307379 0.176177 0.147149 0.047306 0.763073 0.066449 0.064318 0.044324 0.701056 0.715754 0.03159 0 0 0 0.0005 0.01 400 78512 78512 121.499536 67.672544 0.0
2 2 1.82506 0.166375 1.863938 0.16579 618 351.956667 94.780807 102 484 122 168 10 0.790212 0.692195 1.087475 0.635492 0.458425 0.072638 0.07329 0.794482 0.65003 0.077136 0.038399 0.55893 0.37995 0.661505 0 0 0 0.0005 0.02 426 262479 183967 270.289012 183.823189 0.0
3 3 1.644731 0.172948 1.715597 0.206918 999 277.036667 102.316805 80 482 150 144 6 0.67015 0.741367 0.921592 0.687925 0.467737 0.110777 0.085895 0.902588 0.706156 0.116371 0.068154 0.627242 0.393009 0.72974 0 0 0 0.0005 0.03 452 425004 162525 252.1125 296.036711 0.0
4 4 1.582308 0.170746 1.646063 0.17154 1191 212.936667 72.154829 84 426 151 147 2 0.750256 0.713533 1.142597 0.595025 0.311209 0.067831 0.167423 0.969897 0.612455 0.219683 0.144537 0.839175 0.566981 0.617548 0 0 0 0.0005 0.04 478 506994 81990 139.749808 352.94781 0.0
5 5 1.539906 0.15205 1.632733 0.159638 1419 215.606667 76.454029 81 438 157 139 4 0.702097 0.735178 1.055904 0.632314 0.380737 0.086472 0.145343 0.968282 0.670312 0.150867 0.087086 0.799638 0.512332 0.678613 0 0 0 0.0005 0.05 504 604581 97587 175.54236 420.692839 0.0
6 6 1.507297 0.131911 1.551103 0.121757 1620 198.126667 64.505224 70 434 155 145 0 0.751824 0.71669 1.138837 0.60608 0.361705 0.084483 0.148725 0.978515 0.716511 0.154507 0.095547 0.722266 0.422712 0.739829 0 0 0 0.0005 0.06 530 691030 86449 156.954994 479.886765 0.0
7 7 1.480466 0.11357 1.541228 0.124165 1857 208.216667 70.628628 93 458 134 165 1 0.701775 0.736917 1.074006 0.633138 0.404079 0.104553 0.114373 0.96429 0.7404 0.157397 0.096227 0.729988 0.404477 0.775139 0 0 0 0.0005 0.07 556 791582 100552 203.685917 550.002022 0.0
8 8 1.462151 0.094938 1.540773 0.091184 2112 213.253333 73.210126 85 482 148 151 1 0.616802 0.767704 0.967106 0.66954 0.437802 0.086453 0.05579 0.966329 0.75334 0.172627 0.101938 0.722655 0.36329 0.80257 0 0 0 0.0005 0.08 582 900364 108782 230.743162 626.196597 0.0
9 9 1.463118 0.170173 1.524992 0.162274 57 193.966667 64.059859 96 444 133 166 1 0.667878 0.750754 1.040098 0.647249 0.435774 0.077558 0.076283 0.983551 0.779841 0.163994 0.099255 0.670616 0.363966 0.819887 0 0 0 0.0005 0.09 608 96534 96534 207.611614 76.730998 0.0
10 10 1.44892 0.132393 1.505293 0.123528 117 181.763333 66.689784 84 462 144 155 1 0.592014 0.776632 0.941136 0.679547 0.452339 0.064768 0.058417 0.976216 0.769504 0.162214 0.102351 0.735053 0.424305 0.814853 0 0 0 0.0005 0.1 634 197224 100690 232.482844 135.387638 0.0
11 11 1.417094 0.11607 1.488111 0.11986 171 176.816667 51.234914 79 428 141 158 1 0.640494 0.759457 1.026294 0.649403 0.44316 0.086389 0.073781 0.992172 0.778276 0.171837 0.109503 0.745071 0.379566 0.827926 0 0 0 0.0005 0.11 660 288972 91748 199.062656 198.965829 0.0
12 12 1.40421 0.102969 1.482327 0.130042 222 167.816667 51.082055 87 454 144 155 1 0.622973 0.766591 0.965944 0.670829 0.438075 0.085642 0.062645 0.986126 0.770032 0.176492 0.112823 0.756706 0.408543 0.819057 0 0 0 0.0005 0.12 686 378023 89051 204.364297 259.485727 0.0
13 13 1.388012 0.109542 1.481974 0.12043 270 173.986667 51.378334 84 450 157 142 1 0.687521 0.743335 1.052557 0.637123 0.400263 0.108933 0.134833 0.982891 0.7455 0.204704 0.126514 0.724844 0.430122 0.786497 0 0 0 0.0005 0.13 712 455722 77699 201.320067 313.796478 0.0
14 14 1.364462 0.092952 1.454948 0.097873 315 168.61 52.259014 83 434 141 157 2 0.663837 0.751615 1.002019 0.654931 0.408552 0.091132 0.148626 0.984811 0.734644 0.202724 0.126143 0.781276 0.459143 0.776628 0 0 0 0.0005 0.14 738 534838 79116 191.157177 366.878952 0.0
15 15 1.337836 0.092199 1.437554 0.096812 360 169.423333 53.968918 78 450 157 143 0 0.656401 0.752929 0.983647 0.660661 0.41383 0.08956 0.110519 0.991501 0.756486 0.187203 0.114871 0.785756 0.436912 0.801701 0 0 0 0.0005 0.15 764 614349 79511 214.01048 421.371255 0.0
16 16 1.310459 0.091747 1.423533 0.087746 411 173.656667 54.519955 83 468 168 131 1 0.638624 0.759601 0.981285 0.662887 0.430236 0.084393 0.106311 0.983002 0.758783 0.185784 0.112076 0.749265 0.46745 0.80595 0 0 0 0.0005 0.16 790 700274 85925 232.327356 479.923254 0.0
17 17 1.286999 0.086561 1.41554 0.101455 459 164.363333 42.653855 75 294 149 151 0 0.588605 0.779407 0.924123 0.681955 0.43477 0.096031 0.085621 0.995378 0.744908 0.1925 0.119905 0.796478 0.425159 0.80036 0 0 0 0.0005 0.17 816 781310 81036 205.129218 536.337998 0.0
18 18 1.272646 0.081012 1.404538 0.085419 510 168.093333 52.973685 80 448 134 165 1 0.573125 0.785469 0.879483 0.700039 0.436868 0.083669 0.061315 0.986707 0.751189 0.189503 0.117598 0.77472 0.422619 0.808348 0 0 0 0.0005 0.18 842 870250 88940 249.319245 596.277074 0.0
19 19 1.25391 0.079185 1.387003 0.081274 558 158.44 51.38949 63 456 155 145 0 0.58085 0.78248 0.895219 0.692719 0.44574 0.086375 0.0712 0.98796 0.750711 0.187893 0.116968 0.759502 0.394116 0.805469 0 0 0 0.0005 0.19 868 951784 81534 233.226101 651.836553 0.0
20 20 1.635647 0.345896 1.593549 0.250605 26 158.518 47.653139 81 450 254 246 0 0.6005 0.775025 0.909287 0.688188 0.429732 0.092849 0.081633 0.992904 0.747818 0.181044 0.11286 0.788588 0.418301 0.797192 0 0 0 0.0005 0.2 894 132592 132592 377.541322 42.050971 0.0
21 21 1.55624 0.272088 1.534002 0.226235 52 155.618 44.457171 72 448 241 257 2 0.577885 0.784217 0.866142 0.70039 0.416983 0.085716 0.085451 0.986349 0.668385 0.133429 0.081516 0.811471 0.434772 0.694419 0 0 0 0.0005 0.21 920 262818 130226 375.706143 60.397679 0.0
22 22 1.516689 0.23125 1.513051 0.194863 79 156.068 46.935992 70 458 269 231 0 0.583886 0.781094 0.844942 0.711643 0.434256 0.10547 0.090314 0.989173 0.703908 0.124203 0.076655 0.75462 0.391275 0.725105 0 0 0 0.0005 0.22 946 403355 140537 392.274013 92.770921 0.0
23 23 1.492431 0.203033 1.497127 0.175039 108 155.426 59.069794 67 472 257 243 0 0.535536 0.799164 0.812534 0.72207 0.458156 0.088717 0.070105 0.974474 0.700028 0.140025 0.084903 0.742061 0.417717 0.73293 0 0 0 0.0005 0.23 972 548524 145169 465.111248 125.810833 0.0
24 24 1.479463 0.183633 1.484884 0.158837 133 149.096 48.087408 72 476 260 239 1 0.56872 0.787651 0.862148 0.704561 0.444825 0.100412 0.075662 0.984681 0.713049 0.141701 0.086066 0.765747 0.402119 0.745755 0 0 0 0.000499 0.24 998 680650 132126 433.39505 156.424696 0.0
25 25 1.467135 0.166041 1.468577 0.150338 158 143.742 42.414472 65 440 268 230 2 0.600219 0.77575 0.877914 0.699172 0.440465 0.115806 0.11227 0.982272 0.705627 0.133342 0.081914 0.754629 0.38087 0.728834 0 0 0 0.000499 0.25 1024 806533 125883 397.250461 185.217505 0.0
26 26 1.455291 0.160098 1.464196 0.146918 182 146.654 38.670328 71 329 259 241 0 0.598359 0.776127 0.895271 0.691749 0.436944 0.08744 0.110627 0.99617 0.708739 0.14312 0.087798 0.824014 0.450345 0.732909 0 0 0 0.000499 0.26 1050 927965 121432 366.948029 213.299359 0.0
27 27 1.449591 0.149152 1.449721 0.140612 207 137.96 37.518134 65 330 246 254 0 0.523905 0.804521 0.79572 0.727769 0.466838 0.080063 0.063046 0.99679 0.701501 0.139584 0.088055 0.795345 0.396188 0.733688 0 0 0 0.000499 0.27 1076 1057676 129711 410.027446 242.73307 0.0
28 28 1.438782 0.141813 1.45783 0.129802 231 144.854 38.936316 75 332 251 249 0 0.5758 0.785609 0.876261 0.698476 0.444752 0.086718 0.087272 0.996233 0.703593 0.148153 0.090451 0.791524 0.395437 0.730862 0 0 0 0.000499 0.28 1102 1178793 121117 378.036744 270.558211 0.0
29 29 1.433784 0.133265 1.442865 0.121089 257 145.056 39.886048 75 375 250 250 0 0.584125 0.779132 0.84563 0.710208 0.451232 0.121855 0.10753 0.992548 0.730062 0.12869 0.081277 0.73941 0.358599 0.759378 0 0 0 0.000499 0.29 1128 1312764 133971 448.844671 301.309427 0.0
30 30 1.424715 0.120485 1.426066 0.109924 281 142.248 38.539622 72 456 243 257 0 0.597834 0.777374 0.901512 0.691854 0.457732 0.118416 0.107999 0.989923 0.725657 0.13122 0.080508 0.707534 0.327265 0.750566 0 0 0 0.000499 0.3 1154 1438250 125486 450.508371 330.021582 0.0
31 31 1.416119 0.115314 1.424774 0.106615 305 142.062 40.389481 70 460 253 247 0 0.580568 0.78403 0.873009 0.70064 0.463206 0.109231 0.112256 0.987491 0.711374 0.135169 0.082323 0.765994 0.37116 0.73785 0 0 0 0.000499 0.31 1180 1561109 122859 437.292205 357.972519 0.0
32 32 1.40741 0.109786 1.420908 0.102778 330 141.234 34.877088 70 440 279 221 0 0.598987 0.77555 0.886448 0.697542 0.463733 0.127788 0.125232 0.989487 0.730108 0.129943 0.080022 0.709594 0.3578 0.754691 0 0 0 0.000499 0.32 1206 1686058 124949 432.305346 386.141428 0.0
33 33 1.402115 0.105741 1.41019 0.10433 353 135.132 33.641144 72 309 248 252 0 0.588448 0.779436 0.862214 0.704234 0.456191 0.130589 0.115615 0.991437 0.722655 0.12489 0.07923 0.762296 0.383171 0.747584 0 0 0 0.000499 0.33 1232 1807071 121013 425.701731 413.916634 0.0
34 34 1.395507 0.101224 1.407022 0.096257 377 138.784 37.037729 74 440 260 239 1 0.6225 0.768376 0.92883 0.681511 0.452697 0.123334 0.135222 0.987301 0.72426 0.129967 0.079477 0.731552 0.360514 0.74436 0 0 0 0.000499 0.34 1258 1925870 118799 461.969428 441.605961 0.0
35 35 1.389048 0.097046 1.404308 0.091622 399 136.612 35.35717 73 298 276 224 0 0.590707 0.779545 0.87511 0.699566 0.451224 0.125762 0.13098 0.99206 0.708763 0.133565 0.081337 0.757648 0.382605 0.732219 0 0 0 0.000499 0.35 1284 2040157 114287 424.15786 467.508738 0.0
36 36 1.382245 0.093552 1.389348 0.087264 421 130.542 30.995229 70 287 259 241 0 0.586051 0.783001 0.875933 0.700615 0.469231 0.140842 0.14755 0.990032 0.716348 0.121767 0.077096 0.740219 0.362134 0.740623 0 0 0 0.000499 0.36 1310 2154704 114547 433.191501 493.381372 0.0
37 37 1.374056 0.090479 1.383406 0.085172 445 135.946 36.490753 74 408 249 250 1 0.605865 0.773457 0.890263 0.695265 0.45975 0.139051 0.146275 0.986173 0.717547 0.123241 0.077042 0.722175 0.356686 0.738777 0 0 0 0.000499 0.37 1336 2274293 119589 495.962428 519.828066 0.0
38 38 1.369268 0.087104 1.385413 0.085036 468 135.168 32.300771 64 269 276 224 0 0.613469 0.77052 0.895294 0.69344 0.463305 0.153456 0.143658 0.988155 0.73219 0.118592 0.073765 0.710683 0.352435 0.753074 0 0 0 0.000499 0.38 1362 2392144 117851 449.776731 547.348073 0.0
39 39 1.365429 0.084522 1.384179 0.080782 491 135.916 35.186659 69 289 250 250 0 0.600579 0.773522 0.86318 0.70383 0.457518 0.137555 0.140415 0.990494 0.721086 0.123793 0.07558 0.73139 0.370693 0.742502 0 0 0 0.000499 0.39 1388 2511711 119567 493.956468 573.96753 0.0
40 40 1.359174 0.082675 1.419101 0.081544 516 136.146 37.638181 61 468 251 249 0 0.570385 0.787091 0.851999 0.708933 0.481598 0.120965 0.119263 0.989179 0.732243 0.113031 0.07145 0.731038 0.349329 0.753185 0 0 0 0.000498 0.4 1414 2637380 125669 589.497536 602.62573 0.0
41 41 1.355472 0.080755 1.366543 0.077799 540 137.104 44.487315 72 472 266 231 3 0.567181 0.787249 0.831141 0.71472 0.461986 0.102199 0.117119 0.979497 0.707141 0.122474 0.075177 0.738513 0.384665 0.728616 0 0 0 0.000498 0.41 1440 2761295 123915 557.427605 630.898201 0.0
42 42 1.350257 0.079082 1.370358 0.077349 563 130.656 34.155785 69 446 252 248 0 0.583279 0.783145 0.865072 0.703964 0.474576 0.122687 0.126228 0.989221 0.722216 0.110764 0.069598 0.73178 0.349927 0.739821 0 0 0 0.000498 0.42 1466 2877738 116443 498.832649 658.036186 0.0
43 43 1.345581 0.076946 1.361307 0.0741 585 126.99 30.040271 66 356 271 229 0 0.561987 0.789689 0.850113 0.707985 0.468453 0.123663 0.1095 0.992324 0.719087 0.11322 0.071324 0.723373 0.346336 0.740545 0 0 0 0.000498 0.43 1492 2992979 115241 508.026992 683.502118 0.0
44 44 1.340379 0.074895 1.351114 0.072543 607 126.574 32.168937 56 329 269 231 0 0.561237 0.790812 0.827994 0.716544 0.469222 0.125492 0.118993 0.992095 0.709313 0.113853 0.072183 0.738683 0.346141 0.727956 0 0 0 0.000498 0.44 1518 3106341 113362 470.40778 709.555652 0.0
45 45 1.337022 0.073621 1.348928 0.080853 630 127.096 35.480005 58 424 256 243 1 0.538079 0.798386 0.799783 0.7244 0.469758 0.106475 0.101663 0.990462 0.700774 0.114908 0.074222 0.778864 0.374268 0.723075 0 0 0 0.000498 0.45 1544 3222190 115849 545.443484 735.718058 0.0
46 46 1.333747 0.072344 1.34512 0.072535 653 120.612 26.850502 58 214 262 238 0 0.497274 0.81394 0.742504 0.744389 0.502042 0.11585 0.091239 0.993267 0.721473 0.105089 0.069692 0.737892 0.345833 0.743207 0 0 0 0.000498 0.46 1570 3340973 118783 520.953669 763.080214 0.0
47 47 1.329386 0.071233 1.343563 0.070125 676 125.844 33.370341 68 450 254 245 1 0.518304 0.805832 0.770566 0.734606 0.48399 0.116998 0.101534 0.987839 0.710857 0.107984 0.070642 0.765463 0.360075 0.731436 0 0 0 0.000498 0.47 1596 3460368 119395 593.072995 789.955396 0.0
48 48 1.417927 0.041017 1.39409 0.034073 22 126.474 31.189763 67 360 281 219 0 0.563907 0.790047 0.855867 0.705495 0.482078 0.139553 0.139521 0.990215 0.726657 0.10413 0.064004 0.709774 0.314083 0.741422 0 0 0 0.000498 0.8 2437 109113 109113 724.629463 35.931675 0.0
49 49 1.345396 0.029261 1.330821 0.027969 42 117.076 25.347549 59 258 256 244 0 0.535568 0.801546 0.808536 0.719903 0.485718 0.168965 0.147511 0.985622 0.677682 0.081421 0.051564 0.72988 0.335455 0.686715 0 0 0 0.000498 0.816667 2480 213371 104258 705.591564 48.274945 0.0
50 50 1.287963 0.025475 1.2919 0.025217 63 116.926 27.187213 60 231 249 251 0 0.511622 0.810156 0.779749 0.72872 0.509976 0.14422 0.136344 0.989546 0.674869 0.075818 0.048664 0.757046 0.336523 0.680968 0 0 0 0.000498 0.833333 2523 318225 104854 698.035852 71.476189 0.0
Correlations 5 Columns that move together (r² > 0.25)
Outliers 26 Values > 3σ from the mean
Data Quality 3 Missing or empty values
Self-play reinforcement learning run tracking policy loss, game length, MCTS agreement, and value calibration across 177 training iterations.

This dataset contains 176 records across 41 fields: iteration, loss_policy_train, loss_value_train, loss_policy_val, loss_value_val, gradient_steps, and 35 more.