NCM plays each Stockfish dev build 20,000 times against Stockfish 14. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.
| Host | Duration | Avg Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo | 
|---|
| ID | Host | Base NPS | Games | WLD | Standard Elo | Ptnml(0-2) | Gamepair Elo | CLI | PGN | 
|---|
| Commit ID | f907d5b7d93a161bc88aeaec403631de9de092f9 | 
|---|---|
| Author | Brian Sheppard | 
| Date | 2017-06-17 02:30:19 UTC | 
| Move depth calculation in probCut
The change passed an STC regression:
LLR: 2.95 (-2.94,2.94) [-3.00,1.00]
Total: 59350 W: 10793 L: 10738 D: 37819
I verified that there was no change in performance on my machine, but of course YMMV:
Results for 40 tests for each version:
                Base      Test      Diff
        Mean    2014338   2016121   -1783
        StDev   62655     63441     3860
p-value: 0.678
speedup: 0.001
No functional change.
Closes #1137 | |