Scalable In-context Ranking with Generative Models
arxiv.org·6d·
Discuss: Hacker News

Title:Scalable In-context Ranking with Generative Models

View PDF HTML (experimental)

Abstract:In-context Ranking (ICR) is an emerging paradigm for Information Retrieval (IR), which leverages contextual understanding of LLMs by directly incorporating the task description, candidate documents, and the query into the model’s input prompt and tasking the LLM to identify relevant document(s). While it is effective, efficiency is a significant challenge in this paradigm, especially as the candidate list grows due to quadratic/super-linear scaling of attention operation with context length. To this end, this paper first identifies inherent and exploitable structures in the attention of LLMs finetuned for …

Similar Posts

Loading similar posts...