Risk-centered benchmarking of large language models for AI-enabled counseling in chronic autoimmune thyroid eye disease (opens in new tab)
BackgroundThyroid eye disease (TED) is a chronic autoimmune inflammatory orbital disease requiring activity assessment, risk stratification, and triage. As patients increasingly consult large language models (LLMs), evidence on their quality and safety for TED counseling remains limited.MethodsWe conducted a cross-sectional benchmark using a prespecified 35-question Chinese TED counseling bank covering symptom recognition, activity assessment, treatment, daily management, follow-up, and care-...
Read the original article