FIBER: A Multilingual Evaluation Resource for Factual Inference Bias
arxiv.org·1d
🔍Information Retrieval
Preview
Report Post

View PDF HTML (experimental)

Abstract:Large language models are widely used across domains, yet there are concerns about their factual reliability and biases. Factual knowledge probing offers a systematic means to evaluate these aspects. Most existing benchmarks focus on single-entity facts and monolingual data. We therefore present FIBER, a multilingual benchmark for evaluating factual knowledge in single- and multi-entity settings. The dataset includes sentence completion, question-answering, and object-count prediction tasks in English, Italian, and Turkish. Using FIBER, we examine whether the prompt language induces inference bias in entity selection and how large language models perform on multi-entit…

Similar Posts

Loading similar posts...