Dev Builds » 20231231-1900

You are viewing an old NCM Stockfish dev build test. You may find the most recent dev build tests using Stockfish 15 as the baseline here.

Use this dev build

NCM plays each Stockfish dev build 20,000 times against Stockfish 14. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.

Summary

Host Duration Avg Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo
ncm-dbt-01 10:03:08 1222271 3414 1486 249 1679 +131.88 ± 5.16 2 28 456 1173 48 +299.78 ± 15.96
ncm-dbt-02 09:58:40 1235543 3420 1501 283 1636 +129.41 ± 5.19 0 32 480 1146 52 +289.28 ± 15.55
ncm-dbt-03 10:02:00 1237791 3420 1481 304 1635 +124.66 ± 5.28 1 42 491 1131 45 +277.04 ± 15.38
ncm-dbt-04 09:59:52 1225863 3418 1490 290 1638 +127.4 ± 5.16 1 34 480 1152 42 +286.89 ± 15.55
ncm-dbt-05 08:33:05 1231462 2904 1243 207 1454 +129.65 ± 5.64 0 27 407 973 45 +289.69 ± 16.89
ncm-dbt-06 10:06:21 1232265 3424 1493 247 1684 +132.5 ± 5.18 1 31 453 1175 52 +299.98 ± 16.02
20000 8694 1580 9726 +129.23 ± 2.15 5 194 2767 6750 284 +290.34 ± 6.47

Test Detail

ID Host Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo CLI PGN
243520 ncm-dbt-05 1230440 404 162 22 220 +125.6 ± 14.47 0 4 57 138 3 +286.91 ± 45.67
243519 ncm-dbt-02 1236021 420 173 29 218 +124.15 ± 13.42 0 2 64 142 2 +285.61 ± 42.74
243518 ncm-dbt-04 1239353 418 181 31 206 +130.48 ± 14.78 0 3 60 139 7 +290.8 ± 44.39
243517 ncm-dbt-06 1223812 424 187 39 198 +126.59 ± 15.17 0 4 64 136 8 +275.68 ± 42.95
243516 ncm-dbt-01 1238647 414 187 30 197 +138.68 ± 14.12 0 1 55 144 7 +318.72 ± 46.28
243515 ncm-dbt-03 1263773 420 176 32 212 +124.15 ± 14.39 0 5 59 143 3 +282.58 ± 44.9
243471 ncm-dbt-06 1230959 500 217 33 250 +134.15 ± 12.86 0 4 63 178 5 +312.48 ± 43.44
243470 ncm-dbt-03 1231724 500 224 53 223 +123.81 ± 15.13 1 9 67 164 9 +270.57 ± 41.96
243469 ncm-dbt-02 1226501 500 229 51 220 +129.35 ± 13.54 0 3 75 163 9 +285.49 ± 39.5
243468 ncm-dbt-01 1208958 500 212 38 250 +126.18 ± 13.42 1 3 72 169 5 +288.06 ± 40.46
243467 ncm-dbt-04 1223575 500 218 44 238 +126.18 ± 13.6 1 4 70 170 5 +288.06 ± 41.11
243466 ncm-dbt-06 1237197 500 223 43 234 +130.94 ± 13.32 0 5 66 173 6 +298.62 ± 42.41
243465 ncm-dbt-05 1235895 500 223 25 252 +145.54 ± 13.45 0 4 54 182 10 +339.63 ± 47.1
243464 ncm-dbt-03 1256602 500 211 42 247 +122.24 ± 14.0 0 5 80 156 9 +263.42 ± 38.27
243463 ncm-dbt-01 1180788 500 217 37 246 +130.94 ± 12.95 0 4 67 174 5 +301.33 ± 42.04
243462 ncm-dbt-02 1248060 500 220 37 243 +133.34 ± 13.99 0 4 70 165 11 +293.29 ± 41.07
243461 ncm-dbt-04 1208740 500 220 47 233 +125.38 ± 13.07 0 5 71 170 4 +285.49 ± 40.81
243460 ncm-dbt-05 1212542 500 217 41 242 +127.76 ± 14.27 0 6 72 162 10 +277.93 ± 40.53
243459 ncm-dbt-06 1214885 500 222 29 249 +141.44 ± 14.15 0 3 65 168 14 +312.48 ± 42.67
243458 ncm-dbt-03 1222967 500 222 47 231 +126.97 ± 13.58 0 4 75 163 8 +280.42 ± 39.57
243457 ncm-dbt-04 1224195 500 219 43 238 +127.76 ± 14.1 0 9 62 173 6 +288.06 ± 43.65
243456 ncm-dbt-02 1222172 500 221 34 245 +136.56 ± 13.17 0 3 65 174 8 +312.48 ± 42.67
243455 ncm-dbt-01 1237074 500 225 34 241 +139.81 ± 13.64 1 3 58 180 8 +327.18 ± 45.37
243454 ncm-dbt-05 1232446 500 217 42 241 +126.97 ± 13.04 0 3 75 166 6 +285.49 ± 39.5
243453 ncm-dbt-03 1227196 500 209 47 244 +116.77 ± 13.71 0 7 79 159 5 +256.44 ± 38.62
243452 ncm-dbt-06 1244937 500 218 29 253 +138.18 ± 12.33 0 2 62 181 5 +327.18 ± 43.69
243451 ncm-dbt-04 1233235 500 215 43 242 +124.6 ± 13.09 0 5 72 169 4 +282.94 ± 40.5
243450 ncm-dbt-01 1237616 500 212 47 241 +119.11 ± 14.52 0 7 81 152 10 +251.89 ± 38.11
243449 ncm-dbt-02 1237032 500 221 34 245 +136.56 ± 12.98 0 1 70 170 9 +309.64 ± 40.79
243448 ncm-dbt-05 1229103 500 212 41 247 +123.81 ± 14.48 0 9 69 164 8 +270.57 ± 41.4
243447 ncm-dbt-03 1236366 500 219 43 238 +127.76 ± 13.39 0 7 64 175 4 +293.29 ± 43.07
243446 ncm-dbt-06 1249060 500 212 31 257 +131.74 ± 14.88 1 8 59 173 9 +295.94 ± 44.72
243445 ncm-dbt-04 1211475 500 214 45 241 +122.24 ± 13.66 0 3 84 154 9 +263.42 ± 37.11
243444 ncm-dbt-01 1252154 500 213 27 260 +135.76 ± 14.12 0 7 59 175 9 +306.84 ± 44.88
243443 ncm-dbt-02 1230067 500 216 43 241 +125.38 ± 13.96 0 9 64 172 5 +282.94 ± 42.98
243442 ncm-dbt-05 1248348 500 212 36 252 +127.76 ± 13.02 0 1 80 161 8 +282.94 ± 37.88
243441 ncm-dbt-02 1248953 500 221 55 224 +119.89 ± 14.68 0 10 72 160 8 +258.75 ± 40.49
243440 ncm-dbt-04 1240468 500 223 37 240 +135.76 ± 13.39 0 5 61 177 7 +312.48 ± 44.19
243439 ncm-dbt-03 1225912 500 220 40 240 +130.94 ± 13.51 0 5 67 171 7 +295.94 ± 42.08
243438 ncm-dbt-01 1200664 500 220 36 244 +134.15 ± 12.47 0 3 64 179 4 +315.35 ± 43.03
243437 ncm-dbt-06 1225011 500 214 43 243 +123.81 ± 13.28 0 5 74 166 5 +277.93 ± 39.92

Commit

Commit ID b4d995d0d910044cf4ea2ad3ee30fd1d21070cd8
Author Michael Chaly
Date 2023-12-31 19:00:06 UTC
Introduce static evaluation correction history Idea from Caissa (https://github.com/Witek902/Caissa) chess engine. With given pawn structure collect data with how often search result and by how much it was better / worse than static evalution of position and use it to adjust static evaluation of positions with given pawn structure. Details: 1. excludes positions with fail highs and moves producing it being a capture; 2. update value is function of not only difference between best value and static evaluation but also is multiplied by linear function of depth; 3. maximum update value is maximum value of correction history divided by 2; 4. correction history itself is divided by 32 when applied so maximum value of static evaluation adjustment is 32 internal units. Passed STC: https://tests.stockfishchess.org/tests/view/658fc7b679aa8af82b955cac LLR: 2.96 (-2.94,2.94) <0.00,2.00> Total: 128672 W: 32757 L: 32299 D: 63616 Ptnml(0-2): 441, 15241, 32543, 15641, 470 Passed LTC: https://tests.stockfishchess.org/tests/view/65903f6979aa8af82b9566f1 LLR: 2.95 (-2.94,2.94) <0.50,2.50> Total: 97422 W: 24626 L: 24178 D: 48618 Ptnml(0-2): 41, 10837, 26527, 11245, 61 closes https://github.com/official-stockfish/Stockfish/pull/4950 Bench: 1157852
Copyright 2011–2024 Next Chess Move LLC