not much happened today (opens in new tab)
**Microsoft** introduced **MAI-Thinking-1**, a **35B parameter MoE model** with **256K context**, achieving **97% on AIME 2025** and outperforming **Sonnet 4.6** in human preference tests. The broader **7-model MAI family** spans reasoning, code, image, speech, and voice, with third-party availability on **OpenRouter**, **fal**, and **Baseten**. The detailed **109-page technical report** revealed insights on scaling, MFU, RL/post-training, and data curation, highlighting no third-party distil...
Read the original article