The Semantic Illusion: Certified Limits of Embedding-Based Hallucination Detection in RAG Systems
arxiv.org·18h
🔍RAG
Preview
Report Post

View PDF HTML (experimental)

Abstract:Retrieval-Augmented Generation (RAG) systems remain susceptible to hallucinations despite grounding in retrieved evidence. Current detection methods rely on semantic similarity and natural language inference (NLI), but their fundamental limitations have not been rigorously characterized. We apply conformal prediction to hallucination detection, providing finite-sample coverage guarantees that enable precise quantification of detection capabilities. Using calibration sets of approximately 600 examples, we achieve 94% coverage with 0% false positive rate on synthetic hallucinations (Natural Questions). However, on three real hallucination benchmarks spanning multipl…

Similar Posts

Loading similar posts...