Dev Builds » 20240307-1855

You are viewing an old NCM Stockfish dev build test. You may find the most recent dev build tests using Stockfish 15 as the baseline here.

Use this dev build

NCM plays each Stockfish dev build 20,000 times against Stockfish 14. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.

Summary

Host Duration Avg Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo
ncm-dbt-01 11:32:19 1199302 4000 1752 296 1952 +132.54 ± 4.69 0 31 538 1375 56 +301.33 ± 14.68
ncm-dbt-02 11:31:19 1239936 3982 1770 293 1919 +135.32 ± 4.78 0 39 497 1394 61 +309.08 ± 15.3
ncm-dbt-03 11:35:26 1234708 4010 1740 299 1971 +130.68 ± 4.8 1 43 529 1378 54 +296.12 ± 14.82
ncm-dbt-05 11:31:28 1230045 4000 1772 300 1928 +134.15 ± 4.79 1 42 498 1402 57 +306.84 ± 15.28
ncm-dbt-06 11:35:36 1232211 4008 1755 313 1940 +130.86 ± 4.81 1 48 513 1392 50 +298.02 ± 15.05
20000 8789 1501 9710 +132.7 ± 2.13 3 203 2575 6941 278 +302.22 ± 6.71

Test Detail

ID Host Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo CLI PGN
340057 ncm-dbt-06 1255808 8 3 1 4 +88.62 ± 93.84 0 0 2 2 0 +190.67 ± 458.56
340056 ncm-dbt-03 1252598 10 3 0 7 +107.43 ± 84.35 0 0 2 3 0 +240.65 ± 570.11
340055 ncm-dbt-02 1234381 482 221 34 227 +142.24 ± 13.09 0 3 55 176 7 +335.58 ± 46.62
340054 ncm-dbt-05 1234917 500 219 35 246 +134.15 ± 14.15 0 6 64 170 10 +298.62 ± 43.1
340053 ncm-dbt-01 1189043 500 223 33 244 +138.99 ± 12.3 0 2 61 182 5 +330.23 ± 44.08
340052 ncm-dbt-03 1237212 500 221 31 248 +138.99 ± 12.5 0 2 62 180 6 +327.18 ± 43.69
340051 ncm-dbt-06 1239722 500 220 32 248 +137.37 ± 13.34 1 5 53 187 4 +330.23 ± 47.44
340050 ncm-dbt-02 1236584 500 229 37 234 +140.62 ± 13.04 0 4 57 182 7 +330.23 ± 45.79
340049 ncm-dbt-05 1234395 500 220 39 241 +131.74 ± 13.12 0 4 67 173 6 +301.33 ± 42.04
340048 ncm-dbt-01 1201925 500 230 29 241 +148.02 ± 13.54 0 5 49 186 10 +349.43 ± 49.46
340047 ncm-dbt-03 1223439 500 217 37 246 +130.94 ± 13.69 0 6 65 172 7 +295.94 ± 42.75
340046 ncm-dbt-06 1216449 500 222 35 243 +136.56 ± 13.73 0 7 56 180 7 +315.35 ± 46.07
340045 ncm-dbt-02 1235811 500 232 40 228 +140.62 ± 13.8 0 4 61 174 11 +318.25 ± 44.18
340044 ncm-dbt-05 1218761 500 214 32 254 +132.54 ± 14.01 0 5 68 167 10 +293.29 ± 41.75
340043 ncm-dbt-01 1189190 500 215 47 238 +121.45 ± 14.5 0 8 75 158 9 +261.07 ± 39.69
340042 ncm-dbt-03 1246225 500 226 43 231 +133.34 ± 13.45 1 7 52 188 2 +321.19 ± 47.67
340041 ncm-dbt-06 1218677 500 226 31 243 +143.07 ± 13.54 0 6 51 185 8 +336.46 ± 48.37
340040 ncm-dbt-02 1245470 500 224 34 242 +138.99 ± 13.1 0 5 56 183 6 +327.18 ± 46.19
340039 ncm-dbt-05 1216650 500 221 31 248 +138.99 ± 13.85 0 5 60 175 10 +315.35 ± 44.57
340038 ncm-dbt-03 1220521 500 218 39 243 +130.14 ± 14.06 0 4 74 161 11 +282.94 ± 39.86
340037 ncm-dbt-01 1198145 500 206 29 265 +128.55 ± 13.73 0 5 71 166 8 +285.49 ± 40.81
340036 ncm-dbt-06 1225301 500 204 40 256 +118.33 ± 14.03 0 11 67 169 3 +265.78 ± 41.9
340035 ncm-dbt-05 1229574 500 218 34 248 +134.15 ± 12.86 0 3 66 175 6 +309.64 ± 42.33
340034 ncm-dbt-02 1216856 500 221 33 246 +137.37 ± 13.71 0 6 58 178 8 +315.35 ± 45.33
340033 ncm-dbt-03 1245327 500 213 41 246 +124.6 ± 14.47 0 8 71 162 9 +270.57 ± 40.83
340032 ncm-dbt-01 1200132 500 217 46 237 +123.81 ± 12.92 0 4 75 167 4 +280.42 ± 39.57
340031 ncm-dbt-06 1227124 500 219 36 245 +133.34 ± 13.26 0 4 66 173 7 +304.07 ± 42.38
340030 ncm-dbt-05 1236864 500 223 43 234 +130.94 ± 12.76 0 4 66 176 4 +304.07 ± 42.38
340029 ncm-dbt-02 1239980 500 222 36 242 +135.76 ± 12.81 0 1 70 171 8 +309.64 ± 40.79
340028 ncm-dbt-03 1234635 500 210 32 258 +129.35 ± 13.9 0 9 59 177 5 +295.94 ± 44.72
340027 ncm-dbt-01 1202481 500 223 37 240 +135.76 ± 13.2 0 2 69 170 9 +306.84 ± 41.23
340026 ncm-dbt-06 1236245 500 217 47 236 +123.02 ± 14.33 0 10 66 168 6 +273.0 ± 42.27
340025 ncm-dbt-02 1245608 500 206 54 240 +109.07 ± 14.68 0 11 83 149 7 +230.16 ± 37.67
340024 ncm-dbt-05 1248252 500 230 47 223 +133.34 ± 14.34 1 9 51 184 5 +312.48 ± 47.79
340023 ncm-dbt-01 1194361 500 223 36 241 +136.56 ± 12.98 0 3 64 176 7 +315.35 ± 43.03
340022 ncm-dbt-03 1228991 500 220 49 231 +123.81 ± 13.46 0 4 78 161 7 +273.0 ± 38.74
340021 ncm-dbt-06 1224665 500 217 44 239 +125.38 ± 12.51 0 1 80 164 5 +282.94 ± 37.88
340020 ncm-dbt-05 1220954 500 227 39 234 +137.37 ± 13.34 0 6 56 182 6 +321.19 ± 46.14
340019 ncm-dbt-01 1219143 500 215 39 246 +127.76 ± 12.45 0 2 74 170 4 +293.29 ± 39.69
340018 ncm-dbt-02 1264800 500 215 25 260 +138.99 ± 13.29 0 5 57 181 7 +324.17 ± 45.77
340017 ncm-dbt-03 1223429 500 212 27 261 +134.95 ± 13.03 0 3 66 174 7 +309.64 ± 42.33
340016 ncm-dbt-06 1245915 500 227 47 226 +130.94 ± 13.87 0 4 72 164 10 +288.06 ± 40.46

Commit

Commit ID 1db969e6200afe4f023469a56aa5edf755d92bbb
Author rn5f107s2
Date 2024-03-07 18:55:51 UTC
Reduce futility_margin if opponents last move was bad This reduces the futiltiy_margin if our opponents last move was bad by around ~1/3 when not improving and ~1/2.7 when improving, the idea being to retroactively futility prune moves that were played, but turned out to be bad. A bad move is being defined as their staticEval before their move being lower as our staticEval now is. If the depth is 2 and we are improving the opponent worsening flag is not set, in order to not risk having a too low futility_margin, due to the fact that when these conditions are met the futility_margin already drops quite low. Passed STC: https://tests.stockfishchess.org/tests/live_elo/65e3977bf2ef6c733362aae3 LLR: 2.94 (-2.94,2.94) <0.00,2.00> Total: 122432 W: 31884 L: 31436 D: 59112 Ptnml(0-2): 467, 14404, 31035, 14834, 476 Passed LTC: https://tests.stockfishchess.org/tests/live_elo/65e47f40f2ef6c733362b6d2 LLR: 2.94 (-2.94,2.94) <0.50,2.50> Total: 421692 W: 106572 L: 105452 D: 209668 Ptnml(0-2): 216, 47217, 114865, 48327, 221 closes https://github.com/official-stockfish/Stockfish/pull/5092 Bench: 1565939
Copyright 2011–2024 Next Chess Move LLC