Proof Assistants, Interactive Verification, Proof Search, Tactical Reasoning
Mind the Gap: Evaluating Model- and Agentic-Level Vulnerabilities in LLMs with Action Graphs
arxiv.org·6d
Loading...Loading more...
Proof Assistants, Interactive Verification, Proof Search, Tactical Reasoning