NCM plays each Stockfish dev build 20,000 times against Stockfish 15. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.
| Host | Duration | Avg Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo |
|---|
| ID | Host | Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo | CLI | PGN |
|---|
| Commit ID | d2acdac1012cfbd8563fbe70157b2adbb05b2002 |
|---|---|
| Author | Joost VandeVondele |
| Date | 2019-01-09 15:57:24 UTC |
|
Small improvements to the CI infrastructure
- avoid inlining for the debug testing so that suppressions work
- provide more output for triggered errors
No functional change.
|
|