Proof Assistants, Interactive Verification, Proof Search, Tactical Reasoning
KeyKnowledgeRAG (K^2RAG): An Enhanced RAG method for improved LLM question-answering capabilities
arxiv.org·12h
SpatialViz-Bench: Automatically Generated Spatial Visualization Reasoning Tasks for MLLMs
arxiv.org·12h
SQLBarber: A System Leveraging Large Language Models to Generate Customized and Realistic SQL Workloads
arxiv.org·2d
Medical Red Teaming Protocol of Language Models: On the Importance of User Perspectives in Healthcare Settings
arxiv.org·12h
Loading...Loading more...