Locality Sensitive Hashing, Jaccard Similarity, Duplicate Detection, Document Clustering

Fluent Visitors: revisiting a classic design pattern
neilmadden.blog·14h·
Discuss: r/programming
λLambda Formalization
Flag this post
The Constrained Application Protocol (CoAP)
datatracker.ietf.org·1d·
Discuss: Hacker News
🌐NetworkProtocols
Flag this post
Building blobd: single-machine object store with sub-millisecond reads and 15 GB/s uploads
blog.wilsonl.in·2d·
🗃️Database Storage
Flag this post
My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.io·2d·
Discuss: Hacker News
Homebrew CPUs
Flag this post
Casing Collar Identification using AlexNet-based Neural Networks for Depth Measurement in Oil and Gas Wells
arxiv.org·1d
🧠Machine Learning
Flag this post
BeetleFlow: An Integrative Deep Learning Pipeline for Beetle Image Processing
arxiv.org·1d
🌊Streaming Algorithms
Flag this post
A Unified Model for Human Mobility Generation in Natural Disasters
arxiv.org·6h
🔲Cellular Automata
Flag this post
Spatial Secrets: Unlocking Hidden Patterns with Language Models
dev.to·1d·
Discuss: DEV
🧮Kolmogorov Complexity
Flag this post
Reversal Invariance in Autoregressive Language Models
arxiv.org·1d
🤖Grammar Induction
Flag this post
Text-guided Fine-Grained Video Anomaly Detection
arxiv.org·1d
🤖Advanced OCR
Flag this post
Causal Graph Neural Networks for Healthcare
arxiv.org·6h
🌀Riemannian Computing
Flag this post
Why Multimodal AI Broke the Data Pipeline — And How Daft Is Beating Ray and Spark to Fix It
hackernoon.com·2d
🌊Streaming Algorithms
Flag this post
Process Bottleneck Breakthrough: AI-Powered Outcome Prediction
dev.to·8h·
Discuss: DEV
🌊Stream Processing
Flag this post
The Collaboration Gap
arxiv.org·6h
🧠Intelligence Compression
Flag this post
Contrastive Knowledge Transfer and Robust Optimization for Secure Alignment of Large Language Models
arxiv.org·2d
💻Local LLMs
Flag this post
Autobiasing Event Cameras for Flickering Mitigation
arxiv.org·6h
🎬WebCodecs
Flag this post
Weakly Supervised Concept Learning with Class-Level Priors for Interpretable Medical Diagnosis
arxiv.org·1d
🧠Machine Learning
Flag this post
Using ensemble learning with hybrid graph neural networks and transformers to predict traffic in cities
arxiv.org·6h
🧠Machine Learning
Flag this post
Show HN: Extrai – An open-source tool to fight LLM randomness in data extraction
github.com·1d·
Discuss: Hacker News
📋Document Grammar
Flag this post
Un-Attributability: Computing Novelty From Retrieval & Semantic Similarity
arxiv.org·2d
🗂️Vector Databases
Flag this post