[1908.10084] Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks (opens in new tab)

Covered by 3 sources including Towards Data Science, hackernoon.com

BERT (Devlin et al., 2018) and RoBERTa (Liu et al., 2019) has set a new state-of-the-art performance on sentence-pair regression tasks like semantic textual similarity (STS). However, it requires that both sentences are fed into the network, which causes a massive computational overhead: Finding the most similar pair in a collection of 10,000 sentences requires about 50 million inference computations (~65 hours) with BERT. The construction of BERT makes it unsuitable for semantic similarity s...

Read the original article

Sign in to keep reading the full article.

Sign Up Log In

Covered in 3 articles

Towards Data Science·

[1908.10084] Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks (opens in new tab)

Covered in 3 articles

LLM Evals Are Based on Vibes — I Built the Missing Layer That Decides What Ships

Why Modern Search Systems Use Both Dual-Encoders and Cross-Encoders

Evolutionary Data Making – How to train embedding models