Incentives or Ontology? A Structural Rebuttal to OpenAI's Hallucination Thesis
arxiv.org·1d
🎭Anthropic Claude
Preview
Report Post

View PDF

Abstract:OpenAI has recently argued that hallucinations in large language models result primarily from misaligned evaluation incentives that reward confident guessing rather than epistemic humility. On this view, hallucination is a contingent behavioral artifact, remediable through improved benchmarks and reward structures. In this paper, we challenge that interpretation. Drawing on previous work on structural hallucination and empirical experiments using a Licensing Oracle, we argue that hallucination is not an optimization failure but an architectural inevitability of the transformer model. Transformers do not represent the world; they model statistical associations among tokens. Their embedding spaces form a pseudo-ontology derived …

Similar Posts

Loading similar posts...