Why Weibo’s tiny VibeThinker-3B has the AI world arguing over benchmarks again (opens in new tab)
On Sunday, a team of nine researchers at — the Chinese social media giant better known for its microblogging platform than for cutting-edge artificial intelligence — quietly posted a to arXiv that sent shockwaves through the AI research community. Their claim: a language model with just 3 billion parameters can match or exceed the reasoning performance of flagship systems from , The model, called — the American Invitational Mathematics Examination, one of the most demanding standardized math ...
Read the original article