Evaluating Kimi 2.5 vs Kimi 2.6: What happens to agent skills when the model gets smarter? (opens in new tab)

Covers Show HN: A package manager for agent skills with built-in evalsDiscussed on DEV

When a stronger model ships, there are two questions every skill author should want answered, and evals are the only honest way to answer either: Which skills just got absorbed? A model that now knows how to do X natively does not need a skill telling it to do X. Fewer skills to maintain, leaner context, lower cost. Which skills still matter? Behaviour-level guidance (conventions, preferences, project-specific workflows) is not something pretraining will fill in for you. Those skills should k...

Read the original article