Good Learners Think Their Thinking: Generative PRM Makes Large Reasoning Model More Efficient Math Learner
arxiv.org·2d
Can You Trust an LLM with Your Life-Changing Decision? An Investigation into AI High-Stakes Responses
arxiv.org·4d
RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents
arxiv.org·3d
RecUserSim: A Realistic and Diverse User Simulator for Evaluating Conversational Recommender Systems
arxiv.org·2d
CIMR: Contextualized Iterative Multimodal Reasoning for Robust Instruction Following in LVLMs
arxiv.org·3d
Improving annotator selection in Active Learning using a mood and fatigue-aware Recommender System
arxiv.org·2d
Exploring LLM Autoscoring Reliability in Large-Scale Writing Assessments Using Generalizability Theory
arxiv.org·5d
Loading...Loading more...