NCM plays each Stockfish dev build 20,000 times against Stockfish 15. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.
Host | Duration | Avg Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo |
---|
ID | Host | Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo | CLI | PGN |
---|
Commit ID | 29295ecfd357b5421eb4f273761dce8d9661f130 |
---|---|
Author | mstembera |
Date | 2022-09-27 05:44:46 UTC |
Simplify EVASIONS scoring
remove some multipliers & adjust, doesn't change the move ordering
STC https://tests.stockfishchess.org/tests/view/6325c1c9b9c0caa5f4a759ae
LLR: 2.94 (-2.94,2.94) <-1.75,0.25>
Total: 192760 W: 51528 L: 51482 D: 89750
Ptnml(0-2): 642, 20490, 54148, 20380, 720
Credit to locutus2
closes https://github.com/official-stockfish/Stockfish/pull/4171
No functional change
|