Locality Sensitive Hashing, Jaccard Similarity, Duplicate Detection, Document Clustering

Feeds to Scour
SubscribedAll
Scoured 16050 posts in 190.1 ms
Beyond the Geometric Curse: High-Dimensional N-Gram Hashing for Dense Retrieval
arxiv.org·2d
📐Geometric Hashing
Preview
Report Post
Visualizing K-Way Merge: An Interactive Guide to Database Sorting
justinhj.github.io·21h·
🌲B-tree Variants
Preview
Report Post
Dynamic Pattern Matching with Wildcards
arxiv.org·18h
🔤Morris-Pratt
Preview
Report Post
RexRerankers: SOTA Rankers for Product Discovery and AI Assistants
huggingface.co·8h·
Discuss: r/LocalLLaMA
🔍Information Retrieval
Preview
Report Post
RAGStack-Lambda: Scale-to-Zero RAG with Multimodal Search
dev.to·1d·
Discuss: DEV
📊Multi-vector RAG
Preview
Report Post
DASL: Big DASL (BDASL)
dasl.ing·19h
📋DFDL
Preview
Report Post
Data Structures and Algorithms
tech.stonecharioteer.com·1d·
Discuss: Hacker News
📼Tape Combinators
Preview
Report Post
A tiny tool to extract data from any website
superdevpro.com·11h·
Discuss: Hacker News
🕵️Feed Discovery
Preview
Report Post
Baby Shazam: Reverse Audio Search Engine with Qdrant + Discover Similar
pub.towardsai.net·17h
💿FLAC Archaeology
Preview
Report Post
Finding Related Items (2011)
bentilly.blogspot.com·8h·
Discuss: Hacker News
🌳B-tree Archaeology
Preview
Report Post
Mahdi Shamlou | Solving LeetCode #1: Two Sum — The Classic Hash Map Solution
dev.to·1d·
Discuss: DEV
🎯Performance Proofs
Preview
Report Post
Speculative Decoding Is Not a Heuristic
reedmeyerson.com·2d·
Discuss: Hacker News
🌸Bloom Variants
Preview
Report Post
Introduction to PostgreSQL Indexes
dlt.github.io·1d·
Discuss: r/programming
🗄️Database Internals
Preview
Report Post
Functional Optics for Modern Java
blog.scottlogic.com·1d
💧Liquid Types
Preview
Report Post
2026-01-22: Paper Summary: "Towards a better QA process: Automatic detection of quality problems in archived websites using visual comparisons"
ws-dl.blogspot.com·2d·
🧪Archive Fuzzing
Preview
Report Post
Enhancing link prediction in biomedical knowledge graphs with BioPathNet
nature.com·1d·
Discuss: Hacker News
🕸️Graph Embeddings
Preview
Report Post
A crowdsourced repository for optimization constants?
terrytao.wordpress.com·1d·
🎯Performance Proofs
Preview
Report Post
Scalability! But at what COST?
frankmcsherry.org·1d·
Discuss: Hacker News
🔗Topological Sorting
Preview
Report Post
Finding Prime Clusters
johndcook.com·6d
🔍MIN Hash
Preview
Report Post
How Static Analysis Can Expose Personal Data Hidden in Source Code
hackernoon.com·3d
📊Static Analysis
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help