NCM plays each Stockfish dev build 20,000 times against Stockfish 15. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.
Host | Duration | Avg Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo |
---|---|---|---|---|---|---|---|
ncm-dbt-01 | 00:07:28 | 582277 | 64 | 17 5 42 | +65.92 ± 37.57 | 0 2 16 14 0 | +136.97 ± 86.45 |
ncm-dbt-02 | 00:06:42 | 585126 | 62 | 16 15 31 | +5.6 ± 39.74 | 0 6 18 7 0 | +11.21 ± 80.61 |
ncm-dbt-03 | 00:05:51 | 581652 | 48 | 14 13 21 | +7.24 ± 42.71 | 0 4 15 5 0 | +14.48 ± 86.84 |
ncm-dbt-04 | 00:07:29 | 568077 | 66 | 21 19 26 | +10.53 ± 35.73 | 0 5 21 7 0 | +21.08 ± 72.43 |
ncm-dbt-05 | 00:07:34 | 578300 | 62 | 20 14 28 | +33.73 ± 36.65 | 0 3 19 9 0 | +68.1 ± 76.48 |
302 | 88 66 148 | +25.35 ± 17.4 | 0 20 89 42 0 | +50.98 ± 35.46 |
ID | Host | Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo | CLI | PGN | ||
---|---|---|---|---|---|---|---|---|---|---|---|
431391 | ncm-dbt-03 | 581652 | 48 | 14 13 21 | +7.24 ± 42.71 | 0 4 15 5 0 | +14.48 ± 86.84 | ||||
cutechess-cli \ -rounds 266 \ -games 2 \ -concurrency 16 \ -srand 3433214551 \ -pgnout ncm-dbt-20221123-2036-005.pgn \ -openings \ file=UHO_4060_v2.epd \ format=epd \ order=random \ -repeat \ -resign \ movecount=3 \ score=600 \ -draw \ movenumber=34 \ movecount=8 \ score=5 \ -each \ tc=30+0.3 \ timemargin=10000 \ proto=uci \ option.Hash=128 \ option.Threads=8 \ -engine \ name=20221123-2036 \ cmd=docker \ arg=run \ arg=-i \ arg=--rm \ arg=--entrypoint=/engine \ arg=dev_build:1370127fcd72b5c6646ff03a4a779b81ad0bcf3d \ -engine \ name=sf15 \ cmd=docker \ arg=run \ arg=-i \ arg=--rm \ arg=--entrypoint=/engine \ arg=stockfish:15 |
|||||||||||
431390 | ncm-dbt-02 | 585126 | 62 | 16 15 31 | +5.6 ± 39.74 | 0 6 18 7 0 | +11.21 ± 80.61 | ||||
cutechess-cli \ -rounds 266 \ -games 2 \ -concurrency 16 \ -srand 2923865583 \ -pgnout ncm-dbt-20221123-2036-004.pgn \ -openings \ file=UHO_4060_v2.epd \ format=epd \ order=random \ -repeat \ -resign \ movecount=3 \ score=600 \ -draw \ movenumber=34 \ movecount=8 \ score=5 \ -each \ tc=30+0.3 \ timemargin=10000 \ proto=uci \ option.Hash=128 \ option.Threads=8 \ -engine \ name=20221123-2036 \ cmd=docker \ arg=run \ arg=-i \ arg=--rm \ arg=--entrypoint=/engine \ arg=dev_build:1370127fcd72b5c6646ff03a4a779b81ad0bcf3d \ -engine \ name=sf15 \ cmd=docker \ arg=run \ arg=-i \ arg=--rm \ arg=--entrypoint=/engine \ arg=stockfish:15 |
|||||||||||
431389 | ncm-dbt-01 | 582277 | 64 | 17 5 42 | +65.92 ± 37.57 | 0 2 16 14 0 | +136.97 ± 86.45 | ||||
cutechess-cli \ -rounds 266 \ -games 2 \ -concurrency 16 \ -srand 1293476982 \ -pgnout ncm-dbt-20221123-2036-003.pgn \ -openings \ file=UHO_4060_v2.epd \ format=epd \ order=random \ -repeat \ -resign \ movecount=3 \ score=600 \ -draw \ movenumber=34 \ movecount=8 \ score=5 \ -each \ tc=30+0.3 \ timemargin=10000 \ proto=uci \ option.Hash=128 \ option.Threads=8 \ -engine \ name=20221123-2036 \ cmd=docker \ arg=run \ arg=-i \ arg=--rm \ arg=--entrypoint=/engine \ arg=dev_build:1370127fcd72b5c6646ff03a4a779b81ad0bcf3d \ -engine \ name=sf15 \ cmd=docker \ arg=run \ arg=-i \ arg=--rm \ arg=--entrypoint=/engine \ arg=stockfish:15 |
|||||||||||
431388 | ncm-dbt-04 | 568077 | 66 | 21 19 26 | +10.53 ± 35.73 | 0 5 21 7 0 | +21.08 ± 72.43 | ||||
cutechess-cli \ -rounds 266 \ -games 2 \ -concurrency 16 \ -srand 4202539095 \ -pgnout ncm-dbt-20221123-2036-002.pgn \ -openings \ file=UHO_4060_v2.epd \ format=epd \ order=random \ -repeat \ -resign \ movecount=3 \ score=600 \ -draw \ movenumber=34 \ movecount=8 \ score=5 \ -each \ tc=30+0.3 \ timemargin=10000 \ proto=uci \ option.Hash=128 \ option.Threads=8 \ -engine \ name=20221123-2036 \ cmd=docker \ arg=run \ arg=-i \ arg=--rm \ arg=--entrypoint=/engine \ arg=dev_build:1370127fcd72b5c6646ff03a4a779b81ad0bcf3d \ -engine \ name=sf15 \ cmd=docker \ arg=run \ arg=-i \ arg=--rm \ arg=--entrypoint=/engine \ arg=stockfish:15 |
|||||||||||
431387 | ncm-dbt-05 | 578300 | 62 | 20 14 28 | +33.73 ± 36.65 | 0 3 19 9 0 | +68.1 ± 76.48 | ||||
cutechess-cli \ -rounds 266 \ -games 2 \ -concurrency 16 \ -srand 2387284485 \ -pgnout ncm-dbt-20221123-2036-001.pgn \ -openings \ file=UHO_4060_v2.epd \ format=epd \ order=random \ -repeat \ -resign \ movecount=3 \ score=600 \ -draw \ movenumber=34 \ movecount=8 \ score=5 \ -each \ tc=30+0.3 \ timemargin=10000 \ proto=uci \ option.Hash=128 \ option.Threads=8 \ -engine \ name=20221123-2036 \ cmd=docker \ arg=run \ arg=-i \ arg=--rm \ arg=--entrypoint=/engine \ arg=dev_build:1370127fcd72b5c6646ff03a4a779b81ad0bcf3d \ -engine \ name=sf15 \ cmd=docker \ arg=run \ arg=-i \ arg=--rm \ arg=--entrypoint=/engine \ arg=stockfish:15 |
Commit ID | 1370127fcd72b5c6646ff03a4a779b81ad0bcf3d |
---|---|
Author | peregrineshahin |
Date | 2022-11-23 20:36:22 UTC |
Simplify both quiet check evasions' conditions
passed Non-regression STC:
https://tests.stockfishchess.org/tests/view/6370b647f1b748d4819e0b64
LLR: 2.95 (-2.94,2.94) <-1.75,0.25>
Total: 162904 W: 43249 L: 43171 D: 76484
Ptnml(0-2): 491, 17089, 46220, 17155, 497
closes https://github.com/official-stockfish/Stockfish/pull/4228
No functional change
|