Dev Builds » 20230122-0954

Use this dev build

NCM plays each Stockfish dev build 20,000 times against Stockfish 15. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.

Summary

Host Duration Avg Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo
ncm-dbt-01 06:59:47 585133 4012 1179 831 2002 +30.21 ± 5.16 4 294 1064 638 6 +60.53 ± 10.4
ncm-dbt-02 06:59:32 585046 3994 1172 861 1961 +27.11 ± 5.22 3 316 1051 621 6 +54.02 ± 10.47
ncm-dbt-03 06:59:46 587140 4000 1211 880 1909 +28.82 ± 5.08 1 297 1073 628 1 +58.03 ± 10.34
ncm-dbt-04 07:01:10 564180 3994 1150 830 2014 +27.9 ± 5.25 5 315 1035 639 3 +56.51 ± 10.56
ncm-dbt-05 06:59:57 584574 4000 1203 833 1964 +32.23 ± 5.2 6 288 1038 666 2 +65.74 ± 10.54
20000 5915 4235 9850 +29.25 ± 2.32 19 1510 5261 3192 18 +58.96 ± 4.68

Test Detail

ID Host Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo CLI PGN
378954 ncm-dbt-01 585717 12 4 2 6 +58.39 ± 68.56 0 0 4 2 0 +120.33 ± 161.99
378953 ncm-dbt-04 569270 106 26 23 57 +9.84 ± 29.42 0 9 32 12 0 +19.69 ± 59.41
378952 ncm-dbt-02 583154 494 156 95 243 +43.12 ± 14.55 1 28 127 91 0 +89.11 ± 30.16
378951 ncm-dbt-05 586393 500 149 100 251 +34.16 ± 14.78 0 37 128 84 1 +67.55 ± 30.12
378950 ncm-dbt-03 588345 500 155 107 238 +33.46 ± 14.48 0 36 130 84 0 +67.55 ± 29.85
378949 ncm-dbt-01 584537 500 153 108 239 +31.35 ± 14.47 0 37 131 82 0 +63.23 ± 29.72
378948 ncm-dbt-04 570549 500 142 108 250 +23.66 ± 14.83 1 41 131 77 0 +48.96 ± 29.75
378947 ncm-dbt-02 583950 500 153 90 257 +44.01 ± 14.47 0 30 128 91 1 +88.0 ± 30.06
378946 ncm-dbt-03 586308 500 150 107 243 +29.95 ± 15.14 1 40 124 85 0 +61.79 ± 30.65
378945 ncm-dbt-05 584411 500 155 99 246 +39.08 ± 15.21 1 36 119 94 0 +80.63 ± 31.31
378944 ncm-dbt-01 585253 500 129 111 260 +12.51 ± 14.61 1 46 137 66 0 +26.46 ± 29.0
378943 ncm-dbt-04 512142 500 146 96 258 +34.86 ± 15.08 0 39 123 87 1 +68.99 ± 30.78
378942 ncm-dbt-02 586097 500 138 114 248 +16.69 ± 13.77 0 39 149 61 1 +32.05 ± 27.38
378941 ncm-dbt-03 587197 500 141 108 251 +22.96 ± 14.0 0 38 141 71 0 +46.13 ± 28.42
378940 ncm-dbt-05 584034 500 158 100 242 +40.49 ± 14.65 1 31 127 91 0 +83.57 ± 30.21
378939 ncm-dbt-04 572276 388 111 84 193 +24.22 ± 16.49 0 32 103 59 0 +48.67 ± 33.54
378938 ncm-dbt-01 585042 500 147 104 249 +29.95 ± 14.37 0 35 139 74 2 +57.5 ± 28.65
378937 ncm-dbt-02 588004 500 141 109 250 +22.26 ± 13.95 0 37 145 67 1 +43.3 ± 27.89
378936 ncm-dbt-03 587028 500 152 107 241 +31.35 ± 13.94 0 33 139 78 0 +63.23 ± 28.63
378935 ncm-dbt-05 585843 500 153 106 241 +32.75 ± 13.9 0 31 142 76 1 +64.66 ± 28.2
378934 ncm-dbt-01 585084 500 142 94 264 +33.46 ± 14.21 1 30 140 78 1 +67.55 ± 28.46
378933 ncm-dbt-04 571632 500 137 111 252 +18.08 ± 14.92 2 41 137 69 1 +37.67 ± 28.99
378932 ncm-dbt-02 587028 500 150 119 231 +21.57 ± 15.43 0 49 122 78 1 +41.89 ± 30.91
378931 ncm-dbt-05 581569 500 141 116 243 +17.39 ± 15.37 1 49 124 76 0 +36.26 ± 30.66
378930 ncm-dbt-01 586731 500 133 115 252 +12.51 ± 15.36 1 52 125 72 0 +26.46 ± 30.53
378929 ncm-dbt-03 587962 500 134 114 252 +13.9 ± 14.08 0 44 142 64 0 +27.85 ± 28.34
378928 ncm-dbt-04 570549 500 138 103 259 +24.36 ± 15.62 1 47 118 84 0 +50.38 ± 31.41
378927 ncm-dbt-02 584117 500 136 117 247 +13.21 ± 15.05 1 47 136 64 2 +25.06 ± 29.14
378926 ncm-dbt-05 584201 500 146 105 249 +28.56 ± 14.54 2 33 137 78 0 +60.36 ± 28.92
378925 ncm-dbt-01 582694 500 157 94 249 +44.01 ± 14.86 0 33 122 94 1 +88.0 ± 30.9
378924 ncm-dbt-03 585464 500 155 118 227 +25.76 ± 14.98 0 43 128 78 1 +50.38 ± 30.14
378923 ncm-dbt-04 570348 500 149 103 248 +32.06 ± 14.38 1 33 135 81 0 +66.1 ± 29.17
378922 ncm-dbt-02 583782 500 157 118 225 +27.16 ± 15.08 1 41 126 82 0 +56.07 ± 30.4
378921 ncm-dbt-03 587367 500 173 108 219 +45.42 ± 14.41 0 30 125 95 0 +92.46 ± 30.47
378920 ncm-dbt-05 582736 500 153 107 240 +32.06 ± 14.12 1 31 139 79 0 +66.1 ± 28.61
378919 ncm-dbt-01 584706 500 160 100 240 +41.89 ± 14.48 0 31 129 89 1 +83.57 ± 29.93
378918 ncm-dbt-04 570709 500 149 99 252 +34.86 ± 14.57 0 36 128 86 0 +70.44 ± 30.11
378917 ncm-dbt-02 584243 500 141 99 260 +29.25 ± 15.34 0 45 118 87 0 +58.93 ± 31.42
378916 ncm-dbt-01 586435 500 154 103 243 +35.56 ± 14.35 1 30 137 81 1 +71.89 ± 28.86
378915 ncm-dbt-05 587409 500 148 100 252 +33.46 ± 14.99 0 40 122 88 0 +67.55 ± 30.91
378914 ncm-dbt-03 587452 500 151 111 238 +27.85 ± 13.69 0 33 144 73 0 +56.07 ± 27.96
378913 ncm-dbt-04 570148 500 152 103 245 +34.16 ± 14.78 0 37 128 84 1 +67.55 ± 30.12

Commit

Commit ID a08b8d4e9711c20acedbfe17d618c3c384b339ec
Author Joost VandeVondele
Date 2023-01-22 09:54:15 UTC
Update UCI_Elo parameterization The old parameterization (https://github.com/official-stockfish/Stockfish/pull/2225/files) has now become quite inaccurate. This updates the formula based on updated results with master. The formula is based on a fit of the Elo results for games played between master at various skill levels, and various versions of the Stash engine, which have been ranked at CCRL. ``` # PLAYER : RATING ERROR POINTS PLAYED (%) 1 master-skill-19 : 3191.1 40.4 940.0 1707 55 2 master-skill-18 : 3170.3 39.3 1343.0 2519 53 3 master-skill-17 : 3141.3 37.8 2282.0 4422 52 4 master-skill-16 : 3111.2 37.1 2773.0 5423 51 5 master-skill-15 : 3069.5 37.2 2728.5 5386 51 6 master-skill-14 : 3024.8 36.1 2702.0 5339 51 7 master-skill-13 : 2972.9 35.4 2645.5 5263 50 8 master-skill-12 : 2923.1 35.0 2653.5 5165 51 9 master-skill-11 : 2855.5 33.6 2524.0 5081 50 10 master-skill-10 : 2788.3 32.0 2724.5 5511 49 11 stash-bot-v25.0 : 2744.0 31.5 1952.5 3840 51 12 master-skill-9 : 2702.8 30.5 2670.0 5018 53 13 master-skill-8 : 2596.2 28.5 2669.5 4975 54 14 stash-bot-v21.0 : 2561.2 30.0 1338.0 3366 40 15 master-skill-7 : 2499.5 28.5 1934.0 4178 46 16 stash-bot-v20.0 : 2452.6 27.7 1606.5 3378 48 17 stash-bot-v19.0 : 2425.3 26.7 1787.0 3365 53 18 master-skill-6 : 2363.2 26.4 2510.5 4379 57 19 stash-bot-v17.0 : 2280.7 25.4 2209.0 4378 50 20 master-skill-5 : 2203.7 25.3 2859.5 5422 53 21 stash-bot-v15.3 : 2200.0 25.4 1757.0 4383 40 22 stash-bot-v14 : 2145.9 25.5 2890.0 5167 56 23 stash-bot-v13 : 2042.7 25.8 2263.5 4363 52 24 stash-bot-v12 : 1963.4 25.8 1769.5 4210 42 25 master-skill-4 : 1922.9 25.9 2690.0 5399 50 26 stash-bot-v11 : 1873.0 26.3 2203.5 4335 51 27 stash-bot-v10 : 1783.8 27.8 2568.5 4301 60 28 master-skill-3 : 1742.3 27.8 1909.5 4439 43 29 master-skill-2 : 1608.4 29.4 2064.5 4389 47 30 stash-bot-v9 : 1582.6 30.2 2130.0 4230 50 31 master-skill-1 : 1467.6 31.3 2015.5 4244 47 32 stash-bot-v8 : 1452.8 31.5 1953.5 3780 52 33 master-skill-0 : 1320.1 32.9 651.5 2083 31 ``` Skill 0 .. 19, now covers CCRL Blitz Elo from 1320 to 3190, approximately. Indeed, the Elo of stash in this analysis is only to within +- 100 Elo of CCRL, probably because it depends quite a bit on the opponent pool. To obtain a skill level for a given Elo number, the above data is fit as a 3rd degree polynomial Skill(Elo). A quick test confirms the correspondence to the above table: ``` Score of master-elo-2721 vs stash-bot-v21.0: 51 - 16 - 19 [0.703] 86 Elo difference: 150.1 +/- 70.2, LOS: 100.0 %, DrawRatio: 22.1 % ``` closes https://github.com/official-stockfish/Stockfish/pull/4341 No functional change.
Copyright 2011–2024 Next Chess Move LLC