Undertrained Tokens in DeepSeek R1
tokencontributions.substack.comΒ·15hΒ·
Discuss: Substack