Alignment Research, Model Robustness, Adversarial Examples, Risk Assessment
Black-Box Test Code Fault Localization Driven by Large Language Models and Execution Estimation
arxiv.orgยท20h
Angio-Diff: Learning a Self-Supervised Adversarial Diffusion Model for Angiographic Geometry Generation
arxiv.orgยท20h
SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning
arxiv.orgยท20h
Toward Environmentally Equitable AI
cacm.acm.orgยท9h
Signal Use and Emergent Cooperation
arxiv.orgยท20h
MOSCARD -- Causal Reasoning and De-confounding for Multimodal Opportunistic Screening of Cardiovascular Adverse Events
arxiv.orgยท20h
Visual hallucination detection in large vision-language models via evidential conflict
arxiv.orgยท20h
Temporal-IRL: Modeling Port Congestion and Berth Scheduling with Inverse Reinforcement Learning
arxiv.orgยท20h
From 10 to 10,000 Users: The AI Agent Scaling Playbook
pub.towardsai.netยท12h
GLIMPSE: Gradient-Layer Importance Mapping for Prompted Visual Saliency Explanation for Generative LVLMs
arxiv.orgยท20h
Loading...Loading more...