Computational Turing Test Reveals Systematic Differences Between Human and AI Language
arxiv.org·23h·
Discuss: Hacker News
Flag this post

View PDF HTML (experimental)

Abstract:Large language models (LLMs) are increasingly used in the social sciences to simulate human behavior, based on the assumption that they can generate realistic, human-like text. Yet this assumption remains largely untested. Existing validation efforts rely heavily on human-judgment-based evaluations – testing whether humans can distinguish AI from human output – despite evidence that such judgments are blunt and unreliable. As a result, the field lacks robust tools for assessing the realism of LLM-generated text or for calibrating models to real-world data. This paper makes two contributions. First, we introduce a computational Turing test: a validation framework th…

Similar Posts

Loading similar posts...