Dev Builds » 20240705-1343

Use this dev build

NCM plays each Stockfish dev build 20,000 times against Stockfish 15. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.

Summary

Host Duration Avg Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo
ncm-dbt-01 06:52:27 583498 4000 1504 609 1887 +79.08 ± 5.06 1 138 836 1015 10 +165.36 ± 11.77
ncm-dbt-02 06:52:40 585403 4016 1517 614 1885 +79.48 ± 5.09 5 126 854 1007 16 +165.89 ± 11.64
ncm-dbt-03 06:52:39 587562 4002 1497 580 1925 +81.05 ± 5.04 1 130 834 1023 13 +169.39 ± 11.78
ncm-dbt-04 06:49:32 571447 4000 1479 610 1911 +76.7 ± 5.0 0 134 875 979 12 +159.15 ± 11.49
ncm-dbt-05 06:53:09 583874 3982 1463 621 1898 +74.59 ± 4.87 0 122 910 954 5 +155.71 ± 11.22
20000 7460 3034 9506 +78.18 ± 2.24 7 650 4309 4978 56 +163.08 ± 5.18

Test Detail

ID Host Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo CLI PGN
391184 ncm-dbt-03 589368 2 1 0 1 +189.7 ± 55.98 0 0 0 1 0 +1129.65 ± 376.02
391183 ncm-dbt-02 585169 16 5 1 10 +88.68 ± 65.24 0 0 4 4 0 +190.76 ± 189.19
391182 ncm-dbt-05 584832 482 176 75 231 +73.9 ± 14.54 0 18 105 117 1 +153.41 ± 33.34
391181 ncm-dbt-01 582485 500 196 76 228 +85.04 ± 14.78 0 19 94 135 2 +178.11 ± 35.3
391180 ncm-dbt-04 571672 500 192 74 234 +83.57 ± 13.87 0 14 105 130 1 +176.33 ± 33.3
391179 ncm-dbt-02 584369 500 195 67 238 +90.96 ± 12.92 0 6 112 130 2 +192.71 ± 31.67
391178 ncm-dbt-03 586435 500 174 72 254 +71.88 ± 14.1 0 17 116 115 2 +147.19 ± 31.57
391177 ncm-dbt-05 582277 500 186 75 239 +78.43 ± 14.46 0 18 105 125 2 +162.35 ± 33.37
391176 ncm-dbt-01 583614 500 188 67 245 +85.78 ± 13.61 1 8 112 127 2 +181.7 ± 31.89
391175 ncm-dbt-04 570709 500 179 71 250 +76.25 ± 14.25 0 16 113 118 3 +155.54 ± 32.01
391174 ncm-dbt-02 582402 500 192 94 214 +68.99 ± 14.97 1 20 112 114 3 +140.62 ± 32.26
391173 ncm-dbt-03 586393 500 188 64 248 +88.0 ± 14.99 0 20 88 140 2 +185.33 ± 36.42
391172 ncm-dbt-05 582945 500 173 73 254 +70.43 ± 14.2 0 19 113 117 1 +145.54 ± 32.08
391171 ncm-dbt-04 572517 500 191 63 246 +90.97 ± 13.56 0 12 98 140 0 +196.45 ± 34.53
391170 ncm-dbt-01 582235 500 182 81 237 +71.16 ± 14.64 0 22 106 121 1 +147.19 ± 33.22
391169 ncm-dbt-02 587664 500 175 78 247 +68.27 ± 14.67 1 21 108 120 0 +143.89 ± 32.9
391168 ncm-dbt-03 587155 500 194 75 231 +84.3 ± 14.18 0 14 106 127 3 +174.55 ± 33.13
391167 ncm-dbt-04 572800 500 192 85 223 +75.52 ± 14.23 0 17 111 120 2 +155.54 ± 32.36
391166 ncm-dbt-05 582736 500 183 86 231 +68.27 ± 13.39 0 15 123 112 0 +142.26 ± 30.43
391165 ncm-dbt-01 583363 500 196 93 211 +72.61 ± 14.55 0 20 109 119 2 +148.85 ± 32.73
391164 ncm-dbt-02 582945 500 200 65 235 +96.19 ± 14.26 0 12 95 139 4 +202.15 ± 35.11
391163 ncm-dbt-03 587834 500 188 73 239 +81.36 ± 14.11 0 16 104 129 1 +171.02 ± 33.51
391162 ncm-dbt-04 569629 500 185 78 237 +75.52 ± 14.08 0 15 116 116 3 +153.86 ± 31.51
391161 ncm-dbt-05 582903 500 182 76 242 +74.79 ± 14.06 0 17 111 121 1 +155.54 ± 32.36
391160 ncm-dbt-01 584664 500 182 64 254 +83.57 ± 14.16 0 17 98 135 0 +178.11 ± 34.58
391159 ncm-dbt-03 588771 500 198 68 234 +92.45 ± 13.74 0 12 97 140 1 +198.34 ± 34.72
391158 ncm-dbt-02 587664 500 186 73 241 +79.9 ± 14.5 0 18 103 127 2 +165.8 ± 33.7
391157 ncm-dbt-04 570749 500 179 90 231 +62.51 ± 14.17 0 22 117 111 0 +129.35 ± 31.52
391156 ncm-dbt-01 582068 500 192 78 230 +80.63 ± 13.79 0 14 109 126 1 +169.27 ± 32.62
391155 ncm-dbt-05 583908 500 187 84 229 +72.61 ± 12.61 0 9 129 112 0 +152.18 ± 29.18
391154 ncm-dbt-03 587792 500 189 72 239 +82.83 ± 14.14 0 15 105 128 2 +172.78 ± 33.32
391153 ncm-dbt-02 587494 500 189 84 227 +74.06 ± 14.88 1 19 106 122 2 +153.86 ± 33.21
391152 ncm-dbt-04 572356 500 187 72 241 +81.36 ± 14.25 0 16 105 127 2 +169.27 ± 33.34
391151 ncm-dbt-01 582527 500 190 83 227 +75.52 ± 14.51 0 19 107 122 2 +155.54 ± 33.04
391150 ncm-dbt-05 586223 500 189 85 226 +73.34 ± 13.86 0 17 112 121 0 +153.86 ± 32.2
391149 ncm-dbt-02 585464 500 194 78 228 +82.1 ± 14.13 0 14 109 124 3 +169.27 ± 32.62
391148 ncm-dbt-03 586223 500 182 77 241 +74.06 ± 14.88 0 22 103 123 2 +152.18 ± 33.71
391147 ncm-dbt-04 571150 500 174 77 249 +68.27 ± 14.54 0 22 110 117 1 +140.62 ± 32.58
391146 ncm-dbt-01 587028 500 178 67 255 +78.44 ± 14.31 0 19 101 130 0 +165.8 ± 34.05
391145 ncm-dbt-03 588089 500 183 79 238 +73.34 ± 13.86 1 14 115 120 0 +155.54 ± 31.67
391144 ncm-dbt-02 585464 500 181 74 245 +75.52 ± 14.65 2 16 105 127 0 +162.35 ± 33.37
391143 ncm-dbt-05 585169 500 187 67 246 +85.04 ± 12.97 0 9 112 129 0 +181.7 ± 31.89

Commit

Commit ID 2cbc20e846e46da8bfc8e254a7703a0bfad3b850
Author Joost VandeVondele
Date 2024-07-05 13:43:49 UTC
Correct and extend PV lines with decisive TB score Currently (after #5407), SF has the property that any PV line with a decisive TB score contains the corresponding TB position, with a score that correctly identifies the depth at which TB are entered. The PV line that follows might not preserve the game outcome, but can easily be verified and extended based on TB information. This patch provides this functionality, simply extending the PV lines on output, this doesn't affect search. Indeed, if DTZ tables are available, search based PV lines that correspond to decisive TB scores are verified to preserve game outcome, truncating the line as needed. Subsequently, such PV lines are extended with a game outcome preserving line until mate, as a possible continuation. These lines are not optimal mating lines, but are similar to what a user could produce on a website like https://syzygy-tables.info/ clicking always the top ranked move, i.e. minimizing or maximizing DTZ (with a simple tie-breaker for moves that have identical DTZ), and are thus an just an illustration of how to game can be won. A similar approach is already in established in https://github.com/joergoster/Stockfish/tree/matefish2 This also contributes to addressing #5175 where SF can give an incorrect TB win/loss for positions in TB with a movecounter that doesn't reflect optimal play. While the full solution requires either TB generated differently, or a search when ranking rootmoves, current SF will eventually find a draw in these cases, in practice quite quickly, e.g. `1kq5/q2r4/5K2/8/8/8/8/7Q w - - 96 1` `8/8/6k1/3B4/3K4/4N3/8/8 w - - 54 106` Gives the same results as master on an extended set of test positions from https://github.com/mcostalba/Stockfish/commit/9173d29c414ddb8f4bec74e4db3ccbe664c66bf9 with the exception of the above mentioned fen where this commit improves. With https://github.com/vondele/matetrack using 6men TB, all generated PVs verify: ``` Using ../Stockfish/src/stockfish.syzygyExtend on matetrack.epd with --nodes 1000000 --syzygyPath /chess/syzygy/3-4-5-6/WDL:/chess/syzygy/3-4-5-6/DTZ Engine ID: Stockfish dev-20240704-ff227954 Total FENs: 6555 Found mates: 3299 Best mates: 2582 Found TB wins: 568 ``` As repeated DTZ probing could be slow a procedure (100ms+ on HDD, a few ms on SSD), the extension is only done as long as the time taken is less than half the `Move Overhead` parameter. For tournaments where these lines might be of interest to the user, a suitable `Move Overhead` might be needed (e.g. TCEC has 1000ms already). closes https://github.com/official-stockfish/Stockfish/pull/5414 No functional change
Copyright 2011–2024 Next Chess Move LLC