Dev Builds » 20181014-1833

You are viewing an old NCM Stockfish dev build test. You may find the most recent dev build tests using Stockfish 15 as the baseline here.

Use this dev build

NCM plays each Stockfish dev build 20,000 times against Stockfish 14. This yields an approximate Elo difference and establishes confidence in the strength of the dev builds.

Summary

Host Duration Avg Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo

Test Detail

ID Host Base NPS Games WLD Standard Elo Ptnml(0-2) Gamepair Elo CLI PGN

Commit

Commit ID 97d2cc9a9c1c4b6ff1b470676fa18c7fc6509886
Author Joost VandeVondele
Date 2018-10-14 18:33:52 UTC
Randomize draw eval The patch adds a small random component (+-1) to VALUE_DRAW for the evaluation of draw positions (mostly 3folds). This random component is not static, but potentially different for each visit of the node (hence derived from the node counter). The effect is that in positions with many 3fold draw lines, different lines are followed at each iteration. This keeps the search much more dynamic, as opposed to being locked to one particular 3fold. An example of a position where master suffers from 3fold-blindness and this patch solves quickly is the famous TCEC game 53: FEN: 3r2k1/pr6/1p3q1p/5R2/3P3p/8/5RP1/3Q2K1 b - - 0 51 master doesn't see that this is a lost position (draw eval up to depth 50) as Qf6-e6 d4-d5 (found by patch at depth 23) leads to a loss. The 3fold-blindness is more important at longer TC, the patch was yellow STC and LTC, but passed VLTC: STC LLR: -2.95 (-2.94,2.94) [0.00,5.00] Total: 46328 W: 10048 L: 9953 D: 26327 http://tests.stockfishchess.org/tests/view/5b9c0ca20ebc592cf275f7c7 LTC LLR: -2.95 (-2.94,2.94) [0.00,5.00] Total: 54663 W: 8938 L: 8846 D: 36879 http://tests.stockfishchess.org/tests/view/5b9ca1610ebc592cf27601d3 VLTC LLR: 2.95 (-2.94,2.94) [0.00,5.00] Total: 31789 W: 4512 L: 4284 D: 22993 http://tests.stockfishchess.org/tests/view/5b9d1a670ebc592cf276076d Credit to @crossbr for pointing to this problem repeatedly, and giving the hint that many draw lines are typical in those situations. Bench: 4756639
Copyright 2011–2024 Next Chess Move LLC