TLA+, Model Checking, Coq, Theorem Proving, Specification Languages
The Prompt Foreman
dbreunig.com·13h
A Hierarchical and Evolvable Benchmark for Fine-Grained Code Instruction Following with Multi-Turn Feedback
arxiv.org·5d
Loading...Loading more...