CP-Env: Evaluating Large Language Models on Clinical Pathways in a Controllable Hospital Environment
arxiv.org·3d
Effect Handlers
Preview
Report Post

View PDF HTML (experimental)

Abstract:Medical care follows complex clinical pathways that extend beyond isolated physician-patient encounters, emphasizing decision-making and transitions between different stages. Current benchmarks focusing on static exams or isolated dialogues inadequately evaluate large language models (LLMs) in dynamic clinical scenarios. We introduce CP-Env, a controllable agentic hospital environment designed to evaluate LLMs across end-to-end clinical pathways. CP-Env simulates a hospital ecosystem with patient and physician agents, constructing scenarios ranging from triage and specialist consultation to diagnostic testing and multidisciplinary team meetings for agent interaction. …

Similar Posts

Loading similar posts...