Powering Billion-Scale Vector Search with OpenSearch
uber.com·16h
🏗️Search Infrastructure
Preview
Report Post

Share

Introduction

At Uber, our systems handle massive amounts of data daily, from ridesharing to delivery. We’ve traditionally used keyword-based search with Apache Lucene™. However, we needed to move beyond simple keyword matching to semantic search to understand the meaning behind searches.

To achieve this, we adopted Amazon® OpenSearch as our vector search engine. Its scalability, performance, and flexibility were key factors in our decision. This blog post explores our journey of evaluating and implementing OpenSearch for large-scale vector search, focusing specifically on the infrastructure challenges and solutions we encountered.

Why OpenSearch?

Our infrastructure for semantic search began with Apache Lucene and its HNSW (Hierarchical Navigable Small World) algori…

Similar Posts

Loading similar posts...