NCM plays each Stockfish dev build 20,000 times against Stockfish 15. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.
Host | Duration | Avg Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo |
---|
ID | Host | Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo | CLI | PGN |
---|
Commit ID | bc38efd1288c8cb25179935998cfa3dc6c9f4410 |
---|---|
Author | Marco Costalba |
Date | 2013-02-27 07:07:26 UTC |
Remove pruning condition on alpha
Further simplifying on Lucas's idea, seems reliable
in tests:
ELO: 2.15 +-7 (95%) LOS: 84.9%
Total: 9999 W: 1831 L: 1769 D: 6399
|