local LLMs, small LLMs, mixture of experts
RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents
arxiv.org·2d
Improving annotator selection in Active Learning using a mood and fatigue-aware Recommender System
arxiv.org·1d
Explainability Through Systematicity: The Hard Systematicity Challenge for Artificial Intelligence
arxiv.org·2d
DICOM De-Identification via Hybrid AI and Rule-Based Framework for Scalable, Uncertainty-Aware Redaction
arxiv.org·1d
Loading...Loading more...