NCM plays each Stockfish dev build 20,000 times against Stockfish 15. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.
Host | Duration | Avg Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo |
---|
ID | Host | Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo | CLI | PGN |
---|
Commit ID | 8cd5cbf6939d76b33a744f1379a6f84a4ac3a6cb |
---|---|
Author | ttruscott |
Date | 2023-09-03 06:07:59 UTC |
Omit two unneeded tests
These redundant tests were intended as a speed-up, but they do not seem
to provide any speed anymore.
STC: https://tests.stockfishchess.org/tests/view/64e9079c85e3e95030fd8259
LLR: 2.96 (-2.94,2.94) <-1.75,0.25>
Total: 134688 W: 34338 L: 34226 D: 66124
Ptnml(0-2): 426, 15122, 36124, 15258, 414
closes https://github.com/official-stockfish/Stockfish/pull/4767
No functional change
|