A review of OpenAI’s o1 and how we evaluate coding agents (opens in new tab)
Testing OpenAI’s newest o1 series of models with Devin for autonomous coding tasks
Read the original articleTesting OpenAI’s newest o1 series of models with Devin for autonomous coding tasks
Read the original article