Information Retrieval

Feeds to Scour
SubscribedAll
Scoured 179 posts in 4.9 ms

Understanding HNSW: The Engine Behind Fast Vector Search

 🔢Embeddings
chimchim89.github.io·

What Limits Does Quantization Place on Dense Top-$k$ Retrieval? A Theoretical Study

 🔢Embeddings  Content type: Academic
arxiv.org·

HK101-cyber/soc-home-lab: Enterprise SOC home lab ,ELK Stack SIEM, Splunk, Wazuh XDR. Detection rules, threat hunting, attack simulations, dashboards.

 🖥️Homelab  Content type: Code
github.com··r/homelab

The AI Agents Stack (2026 Edition)

 🪟Context Windows  Content type: Blog
oreilly.com·

TrustMargin: Training-Free Arbitration between Parametric Memory and Retrieved Evidence in Large Language Models

 🧠LLMs  Content type: Academic
arxiv.org·

Replica management: Inside the system that keeps Elasticsearch Serverless searches fast at scale

 🏠Self-Hosting  Content type: Blog
elastic.co·

STORM: Stepwise Token Optimization with Reward-Guided Beam Search

 🔤Tokenization  Content type: Academic
arxiv.org·

DeytaHQ/khora: Library for creating knowledge repositories from multi-source data and expose a single query substrate

 🤖AI Agents  Content type: Code
github.com··Hacker News

CompRank: Efficient LLM Reranking via Token-Level Compression and Decoding-Free Scoring

 🪟Context Windows  Content type: Academic
arxiv.org·

HNSW vs LSH: How Elasticsearch hits 0.99 recall@10 at 15,000 QPS — and what it costs

 🔢Embeddings  Content type: Blog
elastic.co·

The Structural Attention Tax: How Retrieval Format Hijacks In-Context Learning Independent of Content

 🪟Context Windows  Content type: Academic
arxiv.org·

Elasticsearch simdvec deep-dive: Walking the memory tightrope to 2x better vector throughput

 🔢Embeddings  Content type: Blog
elastic.co·

Decoupling Semantics and Logic: A Training-Free Coarse-to-Fine Pipeline for Video Retrieval-Augmented Generation

 🪟Context Windows  Content type: Academic
arxiv.org·

hanxiao/omni-macos: Native macOS semantic search over your local files - text, images, audio, video in one vector space, on-device on Apple silicon.

 🔢Embeddings  Content type: Code
github.com··Hacker News

Retrieval Augmented Generation Framework for the Nepali Legal Domain Question Answering

 🪟Context Windows  Content type: Academic
arxiv.org·

SkillResolve-Bench: Measuring and Resolving Same-Capability Ambiguity in Agent Skill Retrieval

 🤖AI Agents  Content type: Academic
arxiv.org·

paradedb/drizzle-paradedb: Official extension to Drizzle for use with ParadeDB

 🪟Context Windows  Content type: Code

The Clock Said Valid. The World Said Otherwise. *CLAIM-24 update — Self-Correcting Systems series*

 🧠LLM Inference  Content type: Code
github.com··DEV

Multilingual Fact-Checking at Scale: Fine-Tuned Compact Models vs LLMs

 🧠Transformers  Content type: Academic
arxiv.org·

gitx64/Multi-lab-environment: A simple configured multilab environment built with disposable but data reliable containers setup

 🏠Self-Hosting  Content type: Code
github.com··r/sysadmin
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help