Retrieval-augmented generation

Feeds to Scour
SubscribedAll
Scoured 53 posts in 4.1 ms

IA-RAG: Interval-Algebra-Driven Temporal Reasoning for Dynamic Knowledge Retrieval

馃Large Language Models (LLMs)Content type: Academic
arxiv.org

SIFT: Selective-Index For Fast Compute of RAG Prefill by Exploiting Attention Invariance

馃挰Prompt optimizations for LLM servingContent type: Academic
arxiv.org

Energy-Efficient On-Device RAG on a Mobile NPU: System Design and Benchmark on Snapdragon X Elite

馃搳AI Performance ProfilingContent type: Academic
arxiv.org

When More Documents Hurt RAG: Mitigating Vector Search Dilution with Domain-Scoped, Model-Agnostic Retrieval

馃Large Language Models (LLMs)Content type: Academic
arxiv.org

Document-Authored Control-Signal Impersonation: A Low-Cost Indirect Prompt Attack on RAG Safety Boundaries

馃挰Prompt optimizations for LLM servingContent type: Academic
arxiv.org

TA-RAG: Tone-Aware Retrieval-Augmented Generation for Peer-Support Health Communication

馃Large Language Models (LLMs)Content type: Academic
arxiv.org

Tail-Aware Adaptive-k: Query-Adaptive Context Selection for Retrieval-Augmented Generation

馃Large Language Models (LLMs)Content type: Academic
arxiv.org

Anything2Skill: Compiling External Knowledge into Reusable Skills for Agents

馃Agents using LLMsContent type: Academic
arxiv.org

When Poison Fails After Retrieval: Revisiting Corpus Poisoning under Chunking and Reranking Pipelines

馃Large Language Models (LLMs)Content type: Academic
arxiv.org

MolE-RAG: Molecular Structure-Enhanced Retrieval-Augmented Generation for Chemistry

馃Large Language Models (LLMs)Content type: Academic
arxiv.org

EverydayGPT: Confidence-Gated Routing for Efficient and Safe Hybrid GPT-RAG Conversational QA

馃Large Language Models (LLMs)Content type: Academic
arxiv.org

TICoder: A Repository-Level Code Generation Framework with Test-Driven Planning and Implementation-Aware Reuse

馃Large Language Models (LLMs)Content type: Academic
arxiv.org

Reducing Hallucinations in Complex Question Answering using Simple Graph-based Retrieval-Augmented Generation (long version)

馃Large Language Models (LLMs)Content type: Academic
arxiv.org

TrustMargin: Training-Free Arbitration between Parametric Memory and Retrieved Evidence in Large Language Models

馃Large Language Models (LLMs)Content type: Academic
arxiv.org

NightFeats @ MMU-RAGent NeurIPS 2025: A Context-Optimized Multi-Agent RAG System for the Text-to-Text Track

馃Large Language Models (LLMs)Content type: Academic
arxiv.org

Beyond Probabilistic Similarity: Structural, Temporal, and Causal Limitations of Retrieval-Augmented Generation in the Legal Domain

馃Large Language Models (LLMs)Content type: Academic
arxiv.org

QCFuse: Query-Aware Cache Fusion via Compressed View for Efficient RAG Serving

馃敡Systems-level optimizations for LLM servingContent type: Academic
arxiv.org

uva-irlab-conv at SemEval-2026 Task 8: Multi-Turn RAG with Learned Sparse Retrieval and Listwise Reranking

馃Large Language Models (LLMs)Content type: Academic
arxiv.org

LongRTL: Graph-Similarity-Guided LLM-driven Long Context RTL Optimization

馃敡Systems-level optimizations for LLM servingContent type: Academic
arxiv.org

The Structural Attention Tax: How Retrieval Format Hijacks In-Context Learning Independent of Content

馃Large Language Models (LLMs)Content type: Academic
arxiv.org

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help