SysMoBench: Evaluating AI on Formally Modeling Complex Real-World Systems (opens in new tab)
This paper presents SysMoBench, a benchmark designed to evaluate generative AI's ability to formally model complex concurrent and distribute...
Read the original article