Distilling Stockfish with One Billion Positions (opens in new tab)
TLDR: I extracted fens and stockfish evaluations for 3.9 billion chess positions. I then trained a neural network on 1 billion of them. To my knowledge, this is the largest open state-value chess dataset released. The dataset is released as the Gigafish dataset on Huggingface. Loss continued to decrease during the entire training run, strongly suggesting the importance of data volume.
Read the original article