On-Policy [LLM] Distillation (2025) (opens in new tab) 🧠LLMs 5 articles covering this post
On-policy, dense supervision is a useful tool for distillation
Read the original articleOn-policy, dense supervision is a useful tool for distillation
Read the original article