MedVoiceBias: A Controlled Study of Audio LLM Behavior in Clinical Decision-Making
arxiv.org·6h
Flag this post

View PDF HTML (experimental)

Abstract:As large language models transition from text-based interfaces to audio interactions in clinical settings, they might introduce new vulnerabilities through paralinguistic cues in audio. We evaluated these models on 170 clinical cases, each synthesized into speech from 36 distinct voice profiles spanning variations in age, gender, and emotion. Our findings reveal a severe modality bias: surgical recommendations for audio inputs varied by as much as 35% compared to identical text-based inputs, with one model providing 80% fewer recommendations. Further analysis uncovered age disparities of up to 12% between young and elderly voices, which persisted in most models despite c…

Similar Posts

Loading similar posts...