Dev Builds » 20220917-0713

Use this dev build

NCM plays each Stockfish dev build 20,000 times against Stockfish 15. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.

Summary

Host Duration Avg Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo
ncm-dbt-01 06:51:08 582987 4000 1133 935 1932 +17.21 ± 5.16 6 348 1090 554 2 +35.21 ± 10.26
ncm-dbt-02 06:51:47 585231 4006 1145 963 1898 +15.8 ± 5.07 3 347 1123 525 5 +31.31 ± 10.08
ncm-dbt-03 06:51:39 586025 4000 1109 923 1968 +16.17 ± 5.15 6 349 1102 539 4 +32.76 ± 10.2
ncm-dbt-04 06:51:09 569917 4000 1118 918 1964 +17.39 ± 5.24 7 358 1066 566 3 +35.56 ± 10.4
ncm-dbt-05 06:52:38 583603 3994 1111 927 1956 +16.02 ± 5.21 3 368 1072 550 4 +31.93 ± 10.37
20000 5616 4666 9718 +16.52 ± 2.31 25 1770 5453 2734 18 +33.35 ± 4.59

Test Detail

ID Host Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo CLI PGN
415413 ncm-dbt-02 587197 6 2 3 1 -58.33 ± 98.71 0 1 2 0 0 -120.24 ± 264.77
415412 ncm-dbt-05 585759 494 137 116 241 +14.78 ± 14.96 0 48 131 67 1 +28.19 ± 29.75
415411 ncm-dbt-01 582652 500 151 122 227 +20.18 ± 13.78 1 35 148 66 0 +41.89 ± 27.47
415410 ncm-dbt-03 585379 500 136 116 248 +13.9 ± 14.34 0 45 141 63 1 +26.46 ± 28.48
415409 ncm-dbt-04 572356 500 143 119 238 +16.69 ± 14.56 0 46 134 70 0 +33.46 ± 29.39
415408 ncm-dbt-02 586858 500 150 114 236 +25.06 ± 13.75 0 34 147 68 1 +48.96 ± 27.58
415407 ncm-dbt-05 582277 500 136 124 240 +8.34 ± 14.65 0 52 134 64 0 +16.69 ± 29.4
415406 ncm-dbt-01 581985 500 142 115 243 +18.78 ± 14.21 1 39 142 68 0 +39.08 ± 28.31
415405 ncm-dbt-03 586181 500 143 131 226 +8.34 ± 13.46 1 40 155 54 0 +18.08 ± 26.58
415404 ncm-dbt-04 570629 500 136 115 249 +14.6 ± 14.78 1 45 137 66 1 +29.25 ± 29.0
415403 ncm-dbt-02 584453 500 153 130 217 +15.99 ± 13.99 0 42 143 65 0 +32.05 ± 28.2
415402 ncm-dbt-03 584369 500 125 109 266 +11.12 ± 15.49 1 54 123 72 0 +23.66 ± 30.78
415401 ncm-dbt-04 569749 500 159 122 219 +25.76 ± 14.6 0 41 131 78 0 +51.8 ± 29.75
415400 ncm-dbt-05 586731 500 130 135 235 -3.47 ± 14.86 2 56 137 55 0 -4.17 ± 29.02
415399 ncm-dbt-01 583489 500 147 116 237 +21.57 ± 14.68 0 44 131 75 0 +43.3 ± 29.76
415398 ncm-dbt-02 587494 500 138 130 232 +5.56 ± 14.28 0 51 140 59 0 +11.12 ± 28.63
415397 ncm-dbt-03 588217 500 142 117 241 +17.38 ± 15.0 0 47 133 68 2 +32.05 ± 29.52
415396 ncm-dbt-04 570989 500 134 110 256 +16.69 ± 14.56 0 45 137 67 1 +32.05 ± 29.0
415395 ncm-dbt-01 583070 500 137 115 248 +15.3 ± 14.19 1 41 143 65 0 +32.05 ± 28.2
415394 ncm-dbt-05 583531 500 132 117 251 +10.43 ± 14.82 0 52 131 67 0 +20.87 ± 29.78
415393 ncm-dbt-02 581277 500 133 115 252 +12.51 ± 15.12 1 49 132 67 1 +25.06 ± 29.65
415392 ncm-dbt-03 584243 500 136 113 251 +15.99 ± 14.89 1 45 135 68 1 +32.05 ± 29.26
415391 ncm-dbt-04 569470 500 133 116 251 +11.82 ± 15.67 0 58 117 75 0 +23.66 ± 31.51
415390 ncm-dbt-01 581902 500 131 104 265 +18.78 ± 14.98 1 44 133 71 1 +37.67 ± 29.51
415389 ncm-dbt-05 581860 500 140 114 246 +18.08 ± 15.05 1 46 129 74 0 +37.67 ± 30.03
415388 ncm-dbt-02 586562 500 136 110 254 +18.08 ± 15.3 0 49 128 71 2 +33.46 ± 30.16
415387 ncm-dbt-01 583908 500 147 129 224 +12.51 ± 14.99 1 48 134 66 1 +25.06 ± 29.4
415386 ncm-dbt-04 567363 500 142 100 258 +29.25 ± 14.71 1 37 131 81 0 +60.36 ± 29.73
415385 ncm-dbt-03 586562 500 139 107 254 +22.27 ± 13.81 0 37 144 69 0 +44.72 ± 28.02
415384 ncm-dbt-05 583279 500 137 103 260 +23.66 ± 14.32 0 38 142 68 2 +44.72 ± 28.29
415383 ncm-dbt-02 585970 500 144 124 232 +13.9 ± 14.21 0 44 143 62 1 +26.46 ± 28.21
415382 ncm-dbt-01 583489 500 141 122 237 +13.21 ± 14.8 1 47 134 68 0 +27.85 ± 29.39
415381 ncm-dbt-04 568275 500 133 123 244 +6.95 ± 15.28 3 48 136 62 1 +16.69 ± 29.14
415380 ncm-dbt-03 586689 500 148 110 242 +26.46 ± 14.13 1 34 141 74 0 +54.65 ± 28.39
415379 ncm-dbt-05 581777 500 146 100 254 +32.05 ± 14.38 0 36 132 82 0 +64.66 ± 29.59
415378 ncm-dbt-02 583740 500 155 121 224 +23.66 ± 14.19 1 36 141 72 0 +48.96 ± 28.41
415377 ncm-dbt-01 583405 500 137 112 251 +17.39 ± 15.12 0 50 125 75 0 +34.86 ± 30.53
415376 ncm-dbt-04 570509 500 138 113 249 +17.39 ± 14.36 2 38 143 67 0 +37.67 ± 28.18
415375 ncm-dbt-03 586562 500 140 120 240 +13.91 ± 15.22 2 47 130 71 0 +30.65 ± 29.9
415374 ncm-dbt-02 583531 500 134 116 250 +12.51 ± 13.96 1 41 147 61 0 +26.46 ± 27.67
415373 ncm-dbt-05 583614 500 153 118 229 +24.36 ± 14.5 0 40 136 73 1 +47.55 ± 29.1

Commit

Commit ID 154e7afed0fe9c6f45a2aee8ef6f38d44076cb19
Author atumanian
Date 2022-09-17 07:13:07 UTC
Simplify trend and optimism. This patch simplifies the formulas used to compute the trend and optimism values before each search iteration. As a side effect, this removes the parameters which make the relationship between the displayed evaluation value and the expected game result asymmetric. I've also provided links to the results of isotonic regression analysis of the relationship between the evaluation and game result (statistical data and a graph) for both tests, which demonstrate that the new version has a more symmetric relationship: STC: [Data and graph](https://github.com/official-stockfish/Stockfish/discussions/4150#discussioncomment-3548954) LTC: [Data and graph](https://github.com/official-stockfish/Stockfish/discussions/4150#discussioncomment-3626311) See also https://github.com/official-stockfish/Stockfish/issues/4142 passed STC: https://tests.stockfishchess.org/tests/view/6313f44b8202a039920e27e6 LLR: 2.96 (-2.94,2.94) <-1.75,0.25> Total: 108016 W: 28903 L: 28760 D: 50353 Ptnml(0-2): 461, 12075, 28850, 12104, 518 passed LTC: https://tests.stockfishchess.org/tests/view/631de45db85daa436625dfe6 LLR: 3.01 (-2.94,2.94) <-1.75,0.25> Total: 34792 W: 9412 L: 9209 D: 16171 Ptnml(0-2): 24, 3374, 10397, 3577, 24 Furthermore, this does not measurably impact Elo strength against weaker engines, as demonstrated in a match of master and patch vs SF13: This patch vs SF 13: https://tests.stockfishchess.org/tests/view/631fa34ae1612778c344c6eb Elo: 141.66 +-1.2 (95%) LOS: 100.0% Total: 100000 W: 48182 L: 9528 D: 42290 Ptnml(0-2): 96, 1426, 13277, 30130, 5071 nElo: 284.13 +-3.3 (95%) PairsRatio: 23.13 Master vs SF 13: https://tests.stockfishchess.org/tests/view/631fa3ece1612778c344c6ff Elo: 143.26 +-1.2 (95%) LOS: 100.0% Total: 100000 W: 48525 L: 9479 D: 41996 Ptnml(0-2): 94, 1537, 13098, 29771, 5500 nElo: 281.70 +-3.3 (95%) PairsRatio: 21.63 closes: https://github.com/official-stockfish/Stockfish/pull/4163 Bench: 4425574
Copyright 2011–2025 Next Chess Move LLC