LongCat-Flash-Thinking, LLM from Meituan (China's Equivalent of Uber Eats)
github.com·5h·
Discuss: Hacker News

LongCat-Flash-Thinking


Tech Report 📄

Model Introduction

We introduce and release LongCat-Flash-Thinking, which is a powerful and efficient large reasoning model (LRM) with 560 billion total parameters, featuring an innovative Mixture-of-Experts (MoE) architecture. The model incorporates a dynamic computation mechanism that activates 18.6B∼31.3B parameters (averaging∼27B) based on contextual demands, optimizing both computational efficiency and performance. LongCat-Flash-Thinking is developed by our DORA system, which is an efficient distributed RL framework that supports asynchronous training and flexible accelerator usage to ensure stability and efficiency. Our comprehensiv…

Similar Posts

Loading similar posts...