not much happened today | AINews (opens in new tab)
**Prime Intellect's `prime-rl` v0.6.0** advances agentic reinforcement learning infrastructure supporting **1 trillion parameter MoE models** with sub-5-minute step times and a **131k context GLM-5 agentic setup**. The release includes optimizations in inference, training, and rollout orchestration, supporting models like **GLM5, Kimi, Nemotron**. **Anthropic's Claude Tag** exemplifies the shift to persistent, asynchronous agents embedded in organizations, already writing **65% of the product...
Read the original article