MiMo-v2.5-Pro-UltraSpeed: 1T model with 1000 TPS (opens in new tab) Content type: Blog 8 articles covering this post
MiMo, in collaboration with TileRT, releases the UltraSpeed mode of Xiaomi MiMo-V2.5-Pro — breaking 1000 tokens/s generation speed on a 1T-parameter model for the first time on commodity GPUs through extreme model-system codesign.
Read the original article