RecUserSim: A Realistic and Diverse User Simulator for Evaluating Conversational Recommender Systems
arxiv.org·2d
RLVMR: Reinforcement Learning with Verifiable Meta-Reasoning Rewards for Robust Long-Horizon Agents
arxiv.org·3d
Vulnerability Mitigation System (VMS): LLM Agent and Evaluation Framework for Autonomous Penetration Testing
arxiv.org·4d
Loading...Loading more...