ANN v3: 200ms p99 query latency over 100 billion vectors (opens in new tab) 聽馃挩Cache Optimization
Our latest ANN release supports scales of 100+ billion vectors in a single search index, with 200ms p99 query latency at 1k QPS and 92% recall.
Read the original article