๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐Ÿ“„ Semantic Chunking

Document Segmentation, Context Windows, Text Boundaries, Retrieval Units

Enhancing Document VQA Models via Retrieval-Augmented Generation
arxiv.orgยท10h
๐Ÿ“„Text Chunking
Show HN: Vectorless RAG
github.comยท5hยท
Discuss: Hacker News
๐Ÿ“ŠMulti-vector RAG
Comparing Six Deep Learning Feature Extractors for CBIR Tasks
hackernoon.comยท7h
๐Ÿ“ŠLearned Metrics
How to Summarize Huge Documents with LLMs: Beyond Token Limits and Basic Prompts
dev.toยท19hยท
Discuss: DEV
๐Ÿ“Text Compression
Why Stacking Sliding Windows Can't See Far
guangxuanx.comยท9hยท
Discuss: Hacker News
๐ŸงฎKolmogorov Complexity
Spontaneous (dis)fluency
languagelog.ldc.upenn.eduยท1h
๐Ÿ›Digital humanities
How AI Retrieves Anatomical Structures Using Vector Databases
hackernoon.comยท5h
๐Ÿ—‚๏ธVector Search
OCR Is Legacy Tech
cloudsquid.ioยท19hยท
Discuss: Hacker News
๐Ÿ‘๏ธOCR Evolution
Learning ON Large Datasets Using Bit-String Trees
arxiv.orgยท1d
๐Ÿ—‚๏ธVector Databases
Googleโ€™s URL Context Grounding: Another Nail in RAGโ€™s Coffin?
towardsdatascience.comยท1d
๐ŸŒ€Brotli Internals
Facts, Arguments, Theses: Building AI Knowledge Retrieval on Meaning, Not Slices
nsavage.substack.comยท4dยท
Discuss: Substack
๐Ÿ“„Text Chunking
An epiphany about bloated web pages might be the result of a dumb network (2023)
boston.conman.orgยท17hยท
Discuss: Hacker News
๐Ÿš€Indie Hacking
Why Iโ€™m Against Claude Codeโ€™s Grep-Only Retrieval? It Just Burns Too Many Tokens
milvus.ioยท2dยท
Discuss: Hacker News
๐ŸŒณIncremental Parsing
Drama Model Inference Efficiency Boosted by 1.7x-2.3x
pytorch.orgยท14hยท
Discuss: Hacker News
๐Ÿš€SIMD Text Processing
Understanding Tool-Integrated Reasoning
arxiv.orgยท10h
๐Ÿ”—Constraint Handling
WoW: A Window-to-Window Incremental Index for Range-Filtering Approximate Nearest Neighbor Search
arxiv.orgยท10h
๐Ÿ—‚๏ธVector Databases
Semantic Embedding in RAG: why close vectors still miss meaning and how to fix it
dev.toยท1dยท
Discuss: DEV
๐ŸงฎVector Embeddings
The Impact of Visual Segmentation on Lexical Word Recognition
arxiv.orgยท1d
๐Ÿ“„OCR
An Efficient Dual-Line Decoder Network with Multi-Scale Convolutional Attention for Multi-organ Segmentation
arxiv.orgยท1d
๐Ÿ“ŠLearned Metrics
Making MCP Tool Use Feel Natural with Context-Aware Tools
ragie.aiยท20hยท
Discuss: Hacker News
๐Ÿ”—Constraint Handling
Loading...Loading more...
AboutBlogChangelogRoadmap