How we built a persistent agent memory layer on Elasticsearch with 0.89 recall and zero tenant leaks (opens in new tab)
Discover the architecture behind a persistent, multi-tenant agent memory layer on Elasticsearch: three indices, hybrid retrieval with RRF and a reranker, supersession, decay, and per-user DLS isolation. R@10 0.89 across 168 questions. Full open-source implementation included.
Read the original article