Improving annotator selection in Active Learning using a mood and fatigue-aware Recommender System
arxiv.orgยท3d
RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents
arxiv.orgยท4d
MemTool: Optimizing Short-Term Memory Management for Dynamic Tool Calling in LLM Agent Multi-Turn Conversations
arxiv.orgยท5d
Exploring LLM Autoscoring Reliability in Large-Scale Writing Assessments Using Generalizability Theory
arxiv.orgยท6d
Spec-VLA: Speculative Decoding for Vision-Language-Action Models with Relaxed Acceptance
arxiv.orgยท4d
SigBERT: Combining Narrative Medical Reports and Rough Path Signature Theory for Survival Risk Estimation in Oncology
arxiv.orgยท3d
Loading...Loading more...