Dev Builds » 20230122-0954

You are viewing an old NCM Stockfish dev build test. You may find the most recent dev build tests using Stockfish 15 as the baseline here.

Use this dev build

NCM plays each Stockfish dev build 20,000 times against Stockfish 7. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.

Summary

Host Duration Avg Base NPS Games Wins Losses Draws Elo
ncm-et-3 11:13:49 1953929 3331 2856 7 468 +443.18 ± 15.76
ncm-et-4 11:13:39 1950632 3298 2837 4 457 +448.03 ± 15.94
ncm-et-9 11:13:30 1956307 3361 2908 4 449 +454.8 ± 16.08
ncm-et-10 11:13:33 1951598 3324 2843 2 479 +442.39 ± 15.55
ncm-et-13 11:13:41 1960552 3353 2877 3 473 +445.58 ± 15.66
ncm-et-15 11:13:19 1960574 3333 2869 5 459 +448.4 ± 15.91
20000 17190 25 2785 +447.03 ± 6.44

Test Detail

ID Host Started (UTC) Duration Base NPS Games Wins Losses Draws Elo CLI PGN
164292 ncm-et-4 2023-01-25 07:09 01:00:31 1959102 298 258 0 40 +457.2 ± 55.16
164291 ncm-et-10 2023-01-25 07:03 01:06:42 1948576 324 274 1 49 +427.36 ± 49.63
164290 ncm-et-15 2023-01-25 07:03 01:07:10 1965435 333 286 0 47 +447.83 ± 50.65
164289 ncm-et-3 2023-01-25 07:03 01:07:15 1935008 331 290 0 41 +472.11 ± 54.49
164288 ncm-et-13 2023-01-25 06:59 01:11:28 1960781 353 303 0 50 +447.17 ± 49.03
164287 ncm-et-9 2023-01-25 06:58 01:12:26 1963626 361 309 1 51 +440.46 ± 48.63
164286 ncm-et-4 2023-01-25 05:27 01:41:15 1958960 500 419 1 80 +419.61 ± 38.47
164285 ncm-et-10 2023-01-25 05:20 01:42:19 1958658 500 428 0 72 +444.08 ± 40.59
164284 ncm-et-15 2023-01-25 05:20 01:42:07 1953454 500 419 0 81 +421.93 ± 38.15
164283 ncm-et-3 2023-01-25 05:20 01:42:36 1957424 500 426 0 74 +438.95 ± 40.01
164282 ncm-et-13 2023-01-25 05:19 01:38:59 1967415 500 433 1 66 +454.76 ± 42.55
164281 ncm-et-9 2023-01-25 05:18 01:39:24 1962187 500 427 1 72 +438.95 ± 40.66
164280 ncm-et-4 2023-01-25 03:44 01:42:43 1954045 500 428 0 72 +444.08 ± 40.59
164279 ncm-et-10 2023-01-25 03:38 01:41:19 1948881 500 429 0 71 +446.7 ± 40.89
164278 ncm-et-13 2023-01-25 03:38 01:39:54 1966465 500 425 0 75 +436.43 ± 39.73
164277 ncm-et-15 2023-01-25 03:38 01:41:45 1959008 500 428 2 70 +438.95 ± 41.29
164276 ncm-et-3 2023-01-25 03:37 01:41:37 1957248 500 428 0 72 +444.08 ± 40.59
164275 ncm-et-9 2023-01-25 03:34 01:42:40 1935504 500 432 2 66 +449.35 ± 42.58
164274 ncm-et-4 2023-01-25 02:00 01:43:16 1950614 500 437 0 63 +468.95 ± 43.54
164273 ncm-et-15 2023-01-25 01:57 01:40:18 1963534 500 425 0 75 +436.43 ± 39.73
164272 ncm-et-3 2023-01-25 01:56 01:40:46 1959118 500 429 0 71 +446.7 ± 40.89
164271 ncm-et-13 2023-01-25 01:56 01:42:01 1948719 500 431 1 68 +449.35 ± 41.89
164270 ncm-et-10 2023-01-25 01:56 01:42:09 1943454 500 424 1 75 +431.48 ± 39.8
164269 ncm-et-9 2023-01-25 01:55 01:38:45 1964623 500 443 0 57 +487.45 ± 45.89
164268 ncm-et-3 2023-01-25 00:16 01:38:39 1961012 500 427 4 69 +431.48 ± 41.59
164267 ncm-et-4 2023-01-25 00:16 01:43:03 1945902 500 434 0 66 +460.32 ± 42.48
164266 ncm-et-10 2023-01-25 00:15 01:39:43 1960344 500 431 0 69 +452.04 ± 41.5
164265 ncm-et-13 2023-01-25 00:15 01:40:20 1961463 500 430 0 70 +449.35 ± 41.19
164264 ncm-et-15 2023-01-25 00:15 01:41:25 1951184 500 442 2 56 +477.99 ± 46.39
164263 ncm-et-9 2023-01-25 00:13 01:41:11 1945225 500 426 0 74 +438.95 ± 40.01
164262 ncm-et-13 2023-01-24 22:35 01:39:07 1962805 500 434 1 65 +457.52 ± 42.89
164261 ncm-et-10 2023-01-24 22:34 01:40:25 1954965 500 435 0 65 +463.15 ± 42.83
164260 ncm-et-15 2023-01-24 22:34 01:40:06 1965240 500 433 1 66 +454.76 ± 42.55
164259 ncm-et-4 2023-01-24 22:34 01:42:19 1937488 500 435 1 64 +460.32 ± 43.24
164258 ncm-et-3 2023-01-24 22:33 01:42:28 1951786 500 438 1 61 +468.96 ± 44.35
164257 ncm-et-9 2023-01-24 22:32 01:40:13 1959889 500 435 0 65 +463.15 ± 42.83
164256 ncm-et-15 2023-01-24 20:53 01:40:28 1966166 500 436 0 64 +466.03 ± 43.18
164255 ncm-et-10 2023-01-24 20:53 01:40:56 1946310 500 422 0 78 +429.05 ± 38.92
164254 ncm-et-9 2023-01-24 20:53 01:38:51 1963097 500 436 0 64 +466.03 ± 43.18
164253 ncm-et-13 2023-01-24 20:52 01:41:52 1956220 500 421 0 79 +426.65 ± 38.66
164252 ncm-et-4 2023-01-24 20:52 01:40:32 1948316 500 426 2 72 +433.94 ± 40.69
164251 ncm-et-3 2023-01-24 20:52 01:40:28 1955909 500 418 2 80 +415.05 ± 38.52

Commit

Commit ID a08b8d4e9711c20acedbfe17d618c3c384b339ec
Author Joost VandeVondele
Date 2023-01-22 09:54:15 UTC
Update UCI_Elo parameterization The old parameterization (https://github.com/official-stockfish/Stockfish/pull/2225/files) has now become quite inaccurate. This updates the formula based on updated results with master. The formula is based on a fit of the Elo results for games played between master at various skill levels, and various versions of the Stash engine, which have been ranked at CCRL. ``` # PLAYER : RATING ERROR POINTS PLAYED (%) 1 master-skill-19 : 3191.1 40.4 940.0 1707 55 2 master-skill-18 : 3170.3 39.3 1343.0 2519 53 3 master-skill-17 : 3141.3 37.8 2282.0 4422 52 4 master-skill-16 : 3111.2 37.1 2773.0 5423 51 5 master-skill-15 : 3069.5 37.2 2728.5 5386 51 6 master-skill-14 : 3024.8 36.1 2702.0 5339 51 7 master-skill-13 : 2972.9 35.4 2645.5 5263 50 8 master-skill-12 : 2923.1 35.0 2653.5 5165 51 9 master-skill-11 : 2855.5 33.6 2524.0 5081 50 10 master-skill-10 : 2788.3 32.0 2724.5 5511 49 11 stash-bot-v25.0 : 2744.0 31.5 1952.5 3840 51 12 master-skill-9 : 2702.8 30.5 2670.0 5018 53 13 master-skill-8 : 2596.2 28.5 2669.5 4975 54 14 stash-bot-v21.0 : 2561.2 30.0 1338.0 3366 40 15 master-skill-7 : 2499.5 28.5 1934.0 4178 46 16 stash-bot-v20.0 : 2452.6 27.7 1606.5 3378 48 17 stash-bot-v19.0 : 2425.3 26.7 1787.0 3365 53 18 master-skill-6 : 2363.2 26.4 2510.5 4379 57 19 stash-bot-v17.0 : 2280.7 25.4 2209.0 4378 50 20 master-skill-5 : 2203.7 25.3 2859.5 5422 53 21 stash-bot-v15.3 : 2200.0 25.4 1757.0 4383 40 22 stash-bot-v14 : 2145.9 25.5 2890.0 5167 56 23 stash-bot-v13 : 2042.7 25.8 2263.5 4363 52 24 stash-bot-v12 : 1963.4 25.8 1769.5 4210 42 25 master-skill-4 : 1922.9 25.9 2690.0 5399 50 26 stash-bot-v11 : 1873.0 26.3 2203.5 4335 51 27 stash-bot-v10 : 1783.8 27.8 2568.5 4301 60 28 master-skill-3 : 1742.3 27.8 1909.5 4439 43 29 master-skill-2 : 1608.4 29.4 2064.5 4389 47 30 stash-bot-v9 : 1582.6 30.2 2130.0 4230 50 31 master-skill-1 : 1467.6 31.3 2015.5 4244 47 32 stash-bot-v8 : 1452.8 31.5 1953.5 3780 52 33 master-skill-0 : 1320.1 32.9 651.5 2083 31 ``` Skill 0 .. 19, now covers CCRL Blitz Elo from 1320 to 3190, approximately. Indeed, the Elo of stash in this analysis is only to within +- 100 Elo of CCRL, probably because it depends quite a bit on the opponent pool. To obtain a skill level for a given Elo number, the above data is fit as a 3rd degree polynomial Skill(Elo). A quick test confirms the correspondence to the above table: ``` Score of master-elo-2721 vs stash-bot-v21.0: 51 - 16 - 19 [0.703] 86 Elo difference: 150.1 +/- 70.2, LOS: 100.0 %, DrawRatio: 22.1 % ``` closes https://github.com/official-stockfish/Stockfish/pull/4341 No functional change.
Copyright 2011–2024 Next Chess Move LLC