RexRerankers: SOTA Rankers for Product Discovery and AI Assistants (opens in new tab)

TL;DR

We introduce RexRerankers, a family of state-of-the-art rerankers that estimate how relevant an e-commerce product is for a given query. We open-source Amazebay, a large-scale dataset collection for training and evaluating product relevance models:

  • Amazebay-Catalog: product metadata for 37M items across categories
  • Amazebay-Relevance: 6M query–product pairs with graded relevance scores, covering ~364k unique queries and ~3M products

For holistic evaluation of product discovery rerankers, we also release ERESS (E-commerce Relevance Evaluation Scoring Suite): 4.7k unique queries and 72k labeled query–product pairs designed to reflect real shopping search behavior.

Finally, we open-source a training recipe for efficient, high-performing rankers using a Distributional-Pointwise Loss that treats annotation noise as signal rather than purely as error-improving robustness and calibration in real-world relevance modeling.

Introduction

Search in modern systems is a multi-stage decision pipeline optimized for speed, relevance, and user satisfaction. Whether you’re building web search, enterprise search, or product search, the dominant architecture is:

  • Candidate generation (retrieval): quickly fetch a few hundred to a few thousand potentially relevant items from millions
  • Reranking: apply a stronger model to reorder those candidates by relevance
  • Post-processing & business logic: enforce constraints (availability, compliance, diversity), personalize, and format results

E-commerce search looks like "search" but the definition of relevance is richer and more constrained. A product can match the query text and still be a bad result due to:

  • Variant and attribute mismatch: size, color, material, compatibility, fit
  • Category intent: "running shoes" vs "shoe laces," "sofa" vs "sofa cover"
  • Brand sensitivity: explicit ("Nike"), implicit ("Apple charger"), or excluded ("no ads," "non-branded")
  • Query language is messy: shorthand, typos, multi-intent queries, and colloquial attributes ("work bag that fits 16 inch laptop")

RexRerankers were built for this modern product discovery setting: high-recall retrieval + strong reranking, optimized for e-commerce semantics. The goal is to make reranking models that are:

  • Accurate on fine-grained product relevance
  • Robust to noisy or ambiguous supervision
  • Practical to deploy with latency and cost constraints
  • Capable of handling indirect utility queries

Data Curation

Loading more...

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help