Semantic Segmentation, Context Windows, Document Boundaries, Retrieval Units

Feeds to Scour
SubscribedAll
Scoured 15593 posts in 709.2 ms
A Systematic Analysis of Chunking Strategies for Reliable Question Answering
arxiv.org·1d
📄Semantic Chunking
Preview
Report Post
GutenOCR: A Grounded Vision-Language Front-End for Documents
arxiv.org·48m
🤖Advanced OCR
Preview
Report Post
Exploring Text Compression
denvaar.dev·1d
📝Text Compression
Preview
Report Post
Inside Mixedbread: How We Built Multimodal Late-Interaction at Billion Scale
mixedbread.com·2d·
Discuss: Hacker News
🗂️Vector Search
Preview
Report Post
Building an Intelligent Web Document Scanner with OCR and Chrome's Built-in AI
dev.to·3h·
Discuss: DEV
📄Document Streaming
Preview
Report Post
Everything Open 2026 – Day 2
blog.darkmere.gen.nz·6h
🔍Archive Search
Preview
Report Post
Redacting Faces, People, Vehicles, and Plates with Amped Replay Assisted Redaction
blog.ampedsoftware.com·14h
🧪Archive Fuzzing
Preview
Report Post
You Probably Don’t Need a Vector Database for Your RAG — Yet
towardsdatascience.com·1d
🗂️Vector Databases
Preview
Report Post
Introducing multimodal retrieval for Amazon Bedrock Knowledge Bases
aws.amazon.com·1d
🌀Brotli Dictionary
Preview
Report Post
Databases are magic ... until ...
silvestreperret.com·20h·
Discuss: Hacker News
🗄️Database Internals
Preview
Report Post
From Retrieval to Relevance: 5 Reranker Types Defining Modern Search Systems
pub.towardsai.net
·1d
🎯Retrieval Systems
Preview
Report Post
The Silent AI Breach: How Data Escapes in Fragments
hackernoon.com·9h
🔓Hacking
Preview
Report Post
Patterns All the Way Down: A Generalization for Graph-Like Things
medium.com·13h·
Discuss: Hacker News
🤝Unification Algorithms
Preview
Report Post
Show HN: We built an OCR API to stop babysitting extraction pipelines
news.ycombinator.com·1d·
Discuss: Hacker News
👁️Constructive OCR
Preview
Report Post
The Document Data Crisis
dev.to·11h·
Discuss: DEV
🤖Archive Automation
Preview
Report Post
featurestorebook/mlfs-book: O'Reilly book - Building Machine Learning Systems with a feature store: batch, real-time, and LLMs
github.com·4h·
Discuss: Hacker News
🧠Machine Learning
Preview
Report Post
Explainer: Tree-sitter vs. LSP
lambdaland.org·1d
🌳Context free grammars
Preview
Report Post
Everything Moe
ianbarber.blog·1d·
Discuss: Hacker News
🧠Learned Compression
Preview
Report Post
Verdent : AI Coding with Parallel Agents
julsimon.medium.com·2d
⚔️Lean Tactics
Preview
Report Post
The 2026 String Similarity Guidelines: Automating the Mistakes of the Past
circleid.com·1d
Format Verification
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help