Skill Distillation (opens in new tab)
I’ve been using state-of-the-art models to teach small models running on my computer how I work. My personal agent, based on The first layer is ~/memories. Before answering any procedural question, the agent searches QMD for the right playbook. The second layer is Skills, atomic SKILL.md files that describe one job each. The skills are written by a frontier model. So are the evaluations that grade them. The same system writes, tests, and rewrites each skill until accuracy converges. It also c...
Read the original article