We Asked Grok Build 0.1 to Recover a Secret Hidden in Git History (opens in new tab)
We wanted to test Grok Build 0.1 on a real agentic coding task from one of the world’s most popular coding benchmarks (Terminal-Bench).
Read the original articleWe wanted to test Grok Build 0.1 on a real agentic coding task from one of the world’s most popular coding benchmarks (Terminal-Bench).
Read the original article