Locality Sensitive Hashing, Jaccard Similarity, Duplicate Detection, Document Clustering

Efficient and accurate search in petabase-scale sequence repositories
nature.comยท2dยท
Discuss: Hacker News
๐Ÿ”„Burrows-Wheeler
Indexing, Hashing
dev.toยท1dยท
Discuss: DEV
๐Ÿš€Query Optimization
Nearest Neighbor CCP-Based Molecular Sequence Analysis
arxiv.orgยท15h
๐Ÿ”„Burrows-Wheeler
An enough week
blog.mitrichev.chยท23hยท
๐ŸงฎZ3 Solver
DupeGuru lets you quickly find and remove duplicate files from your drives
techspot.comยท1d
๐Ÿ”„Content Deduplication
Homomorphism Problems in Graph Databases and Automatic Structures
arxiv.orgยท15h
๐Ÿ”—Graph Isomorphism
Mind the Gap: Quantifying Vocabulary Mismatch in E-Commerce Site Search
searchhub.ioยท1dยท
Discuss: Hacker News
๐Ÿ“ˆSearch Quality
Automated Copyright Infringement Detection via Semantic Fingerprinting and Dynamic Thresholding
dev.toยท1dยท
Discuss: DEV
๐Ÿ‘๏ธPerceptual Hashing
Fast-Convergent Proximity Graphs for Approximate Nearest Neighbor Search
arxiv.orgยท2d
๐Ÿ“Range Queries
[R] DeepSeek 3.2's sparse attention mechanism
reddit.comยท15hยท
๐ŸŒ€Brotli Internals
Explicit Lossless Vertex Expanders!
gilkalai.wordpress.comยท9h
๐Ÿ’ŽInformation Crystallography
Sorting encrypted data without decryption: a practical trick
dev.toยท4hยท
Discuss: DEV
๐Ÿ”Hash Functions
Parameterized Complexity of s-Club Cluster Edge Deletion
arxiv.orgยท1d
๐ŸงฎKolmogorov Complexity
RND1: Simple, Scalable AR-to-Diffusion Conversion
radicalnumerics.aiยท23hยท
Discuss: Hacker News
๐Ÿ’ปLocal LLMs
MetaGraph: Scalable annotated de Bruijn graphs for DNA indexing and alignment
github.comยท1dยท
Discuss: Hacker News
๐Ÿ”„Burrows-Wheeler
MARC: Memory-Augmented RL Token Compression for Efficient Video Understanding
arxiv.orgยท15h
๐Ÿง Learned Codecs
Why Your Simple Password Is a Mathematical Catastrophe
tawandamunongo.devยท1dยท
Discuss: Hacker News
๐Ÿ”Hash Functions
Relational Database Distillation: From Structured Tables to Condensed Graph Data
arxiv.orgยท1d
๐Ÿ“ŠGraph Databases
Writing regex is pure joy. You can't convince me otherwise.
triangulatedexistence.mataroa.blogยท17hยท
โœ…Format Verification