NCM plays each Stockfish dev build 20,000 times against Stockfish 15. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.
Host | Duration | Avg Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo |
---|
ID | Host | Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo | CLI | PGN |
---|
Commit ID | 8c10029df135f841126d17a65002e217c7ca1887 |
---|---|
Author | Marco Costalba |
Date | 2013-04-05 06:59:38 UTC |
Revert "Double Impact of Gain tables"
This reverts commit 36c82b751ce227c05bfb
Seems a regression against 2.3.1 tested with 20K games at 60"+0.05
With patch applied
ELO: 15.44 +-2.8 (95%) LOS: 100.0%
Total: 20000 W: 3928 L: 3040 D: 13032
Without patch applied
ELO: 18.76 +-2.8 (95%) LOS: 100.0%
Total: 20000 W: 3903 L: 2824 D: 13273
bench: 4781239
|