NCM plays each Stockfish dev build 20,000 times against Stockfish 14. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.
| Host | Duration | Avg Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo | 
|---|
| ID | Host | Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo | CLI | PGN | 
|---|
| Commit ID | 1ceaea701baaa79f378b0842ff0fb5d2a1f53ef7 | 
|---|---|
| Author | Joost VandeVondele | 
| Date | 2016-12-22 15:02:32 UTC | 
| Simplify threshold handling for probcut. (#936)
Just use greater equal as this is what see_ge does now.
passed STC
LLR: 2.94 (-2.94,2.94) [-3.00,1.00]
Total: 226506 W: 39755 L: 39978 D: 146773
passed LTC
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 138483 W: 17450 L: 17479 D: 103554
Bench: 5212921 | |