Locality Sensitive Hashing, Jaccard Similarity, Duplicate Detection, Document Clustering

Graph-Augmented Hybrid Retrieval and Multi-Stage Re-ranking: A Framework for High-Fidelity Chunk Retrieval in RAG Systems
dev.toยท9hยท
Discuss: DEV
๐ŸŽฏRetrieval Systems
Building tenets: Intelligent context aggregation for AI pair programming
jddunn.github.ioยท6hยท
Discuss: Hacker News
๐ŸŒ€Brotli Internals
Deep Lookup Network
arxiv.orgยท19h
๐ŸงฎVector Embeddings
BM25F from scratch
softwaredoug.comยท23h
๐Ÿ”Information Retrieval
Learning languages with the help of algorithms
johndcook.comยท1dยท
Discuss: Hacker News
๐ŸงฎKolmogorov Complexity
RAG Explained: Understanding Embeddings, Similarity, and Retrieval
towardsdatascience.comยท1d
๐Ÿ“ŠMulti-vector RAG
2025-09-17: Classic Machine Learning Models and XAI Methods
ws-dl.blogspot.comยท1dยท
๐Ÿง Machine Learning
Facebook Research releases MapAnything, 3D reconstruction from images
github.comยท13hยท
Discuss: Hacker News
๐ŸบComputational Archaeology
GTA -- An ATSP Method: Shifting the Bottleneck from Algorithm to RAM
arxiv.orgยท19h
๐Ÿš€SIMD Text Processing
How I hacked the Placement portal of my college to leak the entire SQL database
infosecwriteups.comยท15h
๐Ÿ—„๏ธDatabase Internals
Rendezvous Hashing Explained (2020)
randorithms.comยท3dยท
๐ŸŒDistributed Hash
Generic functional parallel algorithms: scan and FFT (2017)
dl.acm.orgยท11hยท
Discuss: Hacker News
๐Ÿ”—Functional Compilers
Tractability Frontiers of the Shapley Value for Aggregate Conjunctive Queries
arxiv.orgยท19h
๐Ÿง Query Planners
BM25F from Scratch
softwaredoug.comยท5hยท
Discuss: Hacker News
๐Ÿ”Information Retrieval
Automated Semantic Drift Detection and Mitigation in Real-Time Multimodal Data Streams
dev.toยท5hยท
Discuss: DEV
๐ŸŒŠStream Processing
Implementing a Logical Inference System for Japanese Comparatives
arxiv.orgยท19h
๐ŸงฎProlog Parsing
StringWa.rs on GPUs: Databases & Bioinformatics ๐Ÿฆ 
ashvardanian.comยท3dยท
๐Ÿ”„Burrows-Wheeler
Vectorization in Python for Machine Learning
dev.toยท2dยท
Discuss: DEV
โšกSIMD Vectorization
Faster, more memory-efficient performance in Grafana Mimir
grafana.comยท17hยท
Discuss: Hacker News
๐Ÿ—„๏ธDatabase Internals