🪟 Context Windows - saeedesmaili · Scour

When More Documents Hurt RAG: Mitigating Vector Search Dilution with Domain-Scoped, Model-Agnostic Retrieval

🔍Information Retrieval Academic

Less-relevant results

Deep Dive into LLM Token Cost — Blog Series Part 2: How Prompt Caching Actually Works

🧩Cognitive Science Blog

weidongzhou.wordpress.com··Hacker News

Sales Is the Customer Clock

hari.computer··Hacker News

Agentic Search Models with OpenSearch and Elasticsearch

🔍Information Retrieval Blog

bonsai.io··Hacker News

The Wrong Epsilon to the Brain

hari.computer··Hacker News

Show HN: The Deterministic Core Architecture for AI-Augmented Applications

🎮Reinforcement Learning

brandonbellsystems.com··Hacker News

NightFeats @ MMU-RAGent NeurIPS 2025: A Context-Optimized Multi-Agent RAG System for the Text-to-Text Track

🧠LLMs Academic

An interactive introduction to the terrific experience of rendering Arabic and its technical debt

🧠LLMs Blog

lr0.org··Lobsters, Hacker News, Hacker News

GitLab: Built for the agentic engineering era

📞Function Calling Blog

about.gitlab.com··Hacker News

uva-irlab-conv at SemEval-2026 Task 8: Multi-Turn RAG with Learned Sparse Retrieval and Listwise Reranking

🧠LLMs Academic

manavgup/context-analyzer: Context window usage analyzer for Claude Code — MCP server + interactive dashboard

🐍Python Code

github.com··Hacker News

Show HN: AI Boost – an MCP for accessing your everyday patterns

🔍Information Retrieval

ai-boost.io··Hacker News

See what your AI coding agent is doing with Datadog Lapdog

📞Function Calling

chrisebert.net··Hacker News

Bad MCP design cost your Agent 5× more tokens

🧠LLM Inference Discussion

news.ycombinator.com··Hacker News

SIFT: Selective-Index For Fast Compute of RAG Prefill by Exploiting Attention Invariance

🧠LLMs Academic

Context Sculpting

🧩Cognitive Science Blog

perceptiontheory.bearblog.dev··Hacker News

Claude Fable 5 Launches at #1 on the Artificial Analysis Intelligence Index

🪨Obsidian News

artificialanalysis.ai··Hacker News

Ask HN: Is it feasible to run a model on device for complete privacy?

🏠Self-hosting Discussion

news.ycombinator.com··Hacker News

Tail-Aware Adaptive-k: Query-Adaptive Context Selection for Retrieval-Augmented Generation

🧠LLMs Academic

Agentic search - retrieval, harness, or model?

🔍Information Retrieval Blog

softwaredoug.com

··Hacker News

No more posts from saeedesmaili's subscribed feeds.

Scour all 25258 feeds Learn more about Feeds

Sign up or log in to see more results

Log in to enable infinite scrolling