NCM plays each Stockfish dev build 20,000 times against Stockfish 15. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.
Host | Duration | Avg Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo |
---|
ID | Host | Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo | CLI | PGN |
---|
Commit ID | 95ce443aaacadea777f34d87b0abf984e724f0dd |
---|---|
Author | rn5f107s2 |
Date | 2023-07-03 16:54:22 UTC |
simplified gives check castling
tested verifying perft and bench is unchanged
on a larger set of epds for both standard and FRC chess.
Passed non-regression STC:
https://tests.stockfishchess.org/tests/live_elo/648587be65ffe077ca123d78
LLR: 2.95 (-2.94,2.94) <-1.75,0.25>
Total: 153632 W: 41015 L: 40928 D: 71689
Ptnml(0-2): 377, 16077, 43816, 16174, 372
closes https://github.com/official-stockfish/Stockfish/pull/4628
No functional change
|