Higher-order Logic, Proof Development, Mathematical Foundations, Interactive Verification
Can We Terraform Our Way Out of Earth?
hackernoon.com·13h
ChessArena: A Chess Testbed for Evaluating Strategic Reasoning Capabilities of Large Language Models
arxiv.org·1d
TokenSwap: Backdoor Attack on the Compositional Understanding of Large Vision-Language Models
arxiv.org·1d
Loading...Loading more...