SHIFT: Semantic Harmonization via Index-side Feature Transformation for Multilingual Information Retrieval (opens in new tab)
With the rapid expansion of massive multilingual corpora, Multilingual Information Retrieval (MLIR) has emerged as a critical technology for global information access. MLIR enables users to retrieve semantically relevant documents from multilingual text collections using a single-language query. However, recent multilingual dense retrieval models often exhibit a strong preference for documents in the same language as the query. This leads to sev...
Read the original article