Dev Builds » 20240126-1955

Use this dev build

NCM plays each Stockfish dev build 20,000 times against Stockfish 15. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.

Summary

Host Duration Avg Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo

Test Detail

ID Host Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo CLI PGN

Commit

Commit ID 37bd1e774ee4eb03e558062284da1e72cbce5a95
Author Michael Chaly
Date 2024-01-26 19:55:16 UTC
Do more double extensions Parameter tweak from Black Marlin chess engine. Choose a significantly lower value that triggers in 95% of cases, compared to the usual 84% in standard benchmark runs. Since the introduction by https://github.com/official-stockfish/Stockfish/commit/33a858eaa1f792b3413384a3d0993dba36aca92e this constant has only decreased in value over time. 2-16-17-18-21-22-25-26-52-71-75-93-140 Failed STC really fast: https://tests.stockfishchess.org/tests/view/65b11d05c865510db026df7b LLR: -2.94 (-2.94,2.94) <0.00,2.00> Total: 13216 W: 3242 L: 3485 D: 6489 Ptnml(0-2): 50, 1682, 3371, 1471, 34 Was reasonable at LTC: https://tests.stockfishchess.org/tests/view/65b13e20c865510db026e210 Elo: 1.18 ± 1.5 (95%) LOS: 94.3% Total: 50000 W: 12517 L: 12347 D: 25136 Ptnml(0-2): 31, 5598, 13579, 5754, 38 nElo: 2.45 ± 3.0 (95%) PairsRatio: 1.03 Passed VLTC with STC bounds: https://tests.stockfishchess.org/tests/view/65b18870c865510db026e769 LLR: 2.95 (-2.94,2.94) <0.00,2.00> Total: 30456 W: 7726 L: 7448 D: 15282 Ptnml(0-2): 6, 3111, 8717, 3387, 7 Passed VVLTC with LTC bounds: https://tests.stockfishchess.org/tests/view/65b20b95c865510db026eef0 LLR: 2.95 (-2.94,2.94) <0.50,2.50> Total: 36134 W: 9158 L: 8859 D: 18117 Ptnml(0-2): 3, 3455, 10850, 3758, 1 closes https://github.com/official-stockfish/Stockfish/pull/5013 Bench: 1503692
Copyright 2011–2024 Next Chess Move LLC