Undertrained Tokens in DeepSeek R1
tokencontributions.substack.com·3d·
Discuss: Substack