Idris, Agda, Proof Assistants, Type-Level Programming
Institutional Policy Pathways for Supporting Research Software: Global Trends and Local Practices
arxiv.org·4h
RoleConflictBench: A Benchmark of Role Conflict Scenarios for Evaluating LLMs' Contextual Sensitivity
arxiv.org·4h
Knowledge-Level Consistency Reinforcement Learning: Dual-Fact Alignment for Long-Form Factuality
arxiv.org·1d
Loading...Loading more...