๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐Ÿ“„ Text Chunking

Semantic Segmentation, Context Windows, Document Boundaries, Retrieval Units

Launch HN: Reducto Studio (YC W24) โ€“ Build accurate document pipelines, fast
news.ycombinator.comยท2dยท
Discuss: Hacker News
๐ŸŒ€Brotli Internals
Detect Narrative Threats with AI Personas
askrally.comยท3dยท
Discuss: Hacker News
๐Ÿ“กFeed Archaeology
ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs
arxiv.orgยท2d
๐Ÿ’ปLocal LLMs
LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning
arxiv.orgยท2d
๐Ÿ“Concrete Syntax
Generalizing Vision-Language Models to Novel Domains: A Comprehensive Survey
arxiv.orgยท2d
๐Ÿค–Advanced OCR
๐Ÿง  We Save Links, But We Don't Save Knowledge โ€” Why I'm Rethinking Web Reading
dev.toยท19hยท
Discuss: DEV
๐Ÿ”—Online Curation
The Engineering Tradeoffs Behind HNSW-Based Vector Search
dev.toยท2hยท
Discuss: DEV
๐Ÿ—‚๏ธVector Databases
Semantic similarity estimation for domain specific data using BERT and other techniques
arxiv.orgยท2d
๐Ÿ”Semantic Search
QuranMorph: Morphologically Annotated Quranic Corpus
arxiv.orgยท2d
๐Ÿ“‹Document Grammar
Granular-Ball-Induced Multiple Kernel K-Means
arxiv.orgยท2d
๐ŸŒ€Differential Geometry
Scalable Machine Learning Algorithms using Path Signatures
arxiv.orgยท2d
๐Ÿง Machine Learning
Computational Approaches to Understanding Large Language Model Impact on Writing and Information Ecosystems
arxiv.orgยท2d
๐Ÿ“œDigital Philology
AMF-MedIT: An Efficient Align-Modulation-Fusion Framework for Medical Image-Tabular Data
arxiv.orgยท1d
๐Ÿค–Advanced OCR
When Data Becomes a Bottleneck: Why Smart People Still Struggle to Get Answers
dev.toยท21hยท
Discuss: DEV
๐ŸŒŠStream Processing
A Complete Guide to Retrieval-Augmented Generation
dev.toยท3dยท
Discuss: DEV
๐ŸŒ€Brotli Internals
Machine Learning Fundamentals: accuracy with python
dev.toยท1dยท
Discuss: DEV
๐Ÿ‘๏ธObservatory Systems
Machine Learning Fundamentals: active learning
dev.toยท1dยท
Discuss: DEV
๐Ÿค–Grammar Induction
Tackling Data Heterogeneity in Federated Learning through Knowledge Distillation with Inequitable Aggregation
arxiv.orgยท4h
๐Ÿง Machine Learning
Probing AI Safety with Source Code
arxiv.orgยท4h
โœจEffect Handlers
Kafka Fundamentals: kafka retention.ms
dev.toยท22hยท
Discuss: DEV
๐ŸŒŠStreaming Systems
Loading...Loading more...
AboutBlogChangelogRoadmap