Locality Sensitive Hashing, Jaccard Similarity, Duplicate Detection, Document Clustering

Feeds to Scour
SubscribedAll
Scoured 9590 posts in 1.13 s
Natural language processing for word sense disambiguation and information extraction
arxiv.orgยท12hยท
Discuss: r/compsci
๐Ÿ“ฅFeed Aggregation
Preview
Report Post
Performance Hints for BigQuery
trmlabs.comยท1hยท
Discuss: Hacker News
๐Ÿš€Query Optimization
Preview
Report Post
What Deep Learning Theory Teaches Us About AI Memory
dev.toยท1dยท
Discuss: DEV
๐Ÿง Learned Compression
Preview
Report Post
Soft Filtering: Guiding Zero-shot Composed Image Retrieval with Prescriptive and Proscriptive Constraints
arxiv.orgยท2d
๐ŸงฎVector Embeddings
Preview
Report Post
Document search using Claude and an inverted index.
annanay.devยท4dยท
Discuss: Hacker News
๐Ÿ—‚๏ธVector Databases
Preview
Report Post
Climate Monitoring Search Engine: Multi-Vectors in Qdrant
pub.towardsai.net
ยท4d
๐Ÿ—‚๏ธVector Search
Preview
Report Post
Real Time Detection and Quantitative Analysis of Spurious Forgetting in Continual Learning
arxiv.orgยท2d
๐Ÿ’ปLocal LLMs
Preview
Report Post
Optimizing Text Search: A Novel Pattern Matching Algorithm Based on Ukkonen's Approach
arxiv.orgยท5d
๐ŸŒณTrie Structures
Preview
Report Post
MCAP Indexing โ€” Monday Morning Haskell
mmhaskell.comยท5d
๐Ÿ”„Burrows-Wheeler
Preview
Report Post
Benchmarking and Enhancing VLM for Compressed Image Understanding
arxiv.orgยท2d
๐Ÿง Learned Compression
Preview
Report Post
A data-directed approach to lexing.
lambda-the-ultimate.orgยท3d
๐ŸงชBinary Fuzzing
Preview
Report Post
3 Smart Ways to Encode Categorical Features for Machine Learning - MachineLearningMastery.com
machinelearningmastery.comยท5d
๐Ÿง Machine Learning
Preview
Report Post
Community detection in networks: A user guide
dev.toยท2dยท
Discuss: DEV
๐ŸŽฏRecommendation Metrics
Preview
Report Post
CHAMMI-75: pre-training multi-channel models with heterogeneous microscopy images
arxiv.orgยท2d
๐Ÿง Machine Learning
Preview
Report Post
MMSRARec: Summarization and Retrieval Augumented Sequential Recommendation Based on Multimodal Large Language Model
arxiv.orgยท2d
๐Ÿ”Information Retrieval
Preview
Report Post
Towards Ancient Plant Seed Classification: A Benchmark Dataset and Baseline Model
arxiv.orgยท4d
๐Ÿค–Paleographic ML
Preview
Report Post
Retrieval-Augmented Generation for Large Language Models: A Survey
paperium.netยท5dยท
Discuss: DEV
๐ŸŒ€Brotli Internals
Preview
Report Post
Error Localization, Certificates, and Hints for Probabilistic Program Verification via Slicing (Extended Version)
arxiv.orgยท3d
๐Ÿ“œProof Carrying Code
Preview
Report Post
Reducing Label Dependency in Human Activity Recognition with Wearables: From Supervised Learning to Novel Weakly Self-Supervised Approaches
arxiv.orgยท3d
๐Ÿ“ŠLearned Metrics
Preview
Report Post