🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
📄 Semantic Chunking

Document Segmentation, Context Windows, Text Boundaries, Retrieval Units

JupyterLab-PKM 0.1.12
electricarchaeology.ca·10h
🌀Brotli Internals
Show HN: Requests-Based Google Maps Scraper
apify.com·8h·
Discuss: Hacker News
🔍BitFunnel
The modern text processing pipeline: Overview
newroadoldway.com·2d·
Discuss: Lobsters, r/programming
🔤Unicode Normalization
The collective waste caused by poor documentation
shanrauf.com·1d·
Discuss: Hacker News
📦Deflate
Launch HN: Reducto Studio (YC W24) – Build accurate document pipelines, fast
news.ycombinator.com·2d·
Discuss: Hacker News
🌀Brotli Internals
Conversational Intent-Driven GraphRAG: Enhancing Multi-Turn Dialogue Systems through Adaptive Dual-Retrieval of Flow Patterns and Context Semantics
arxiv.org·1d
🧮Prolog Parsing
QuranMorph: Morphologically Annotated Quranic Corpus
arxiv.org·2d
📋Document Grammar
OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling
arxiv.org·1h
🔲Cellular Automata
AMF-MedIT: An Efficient Align-Modulation-Fusion Framework for Medical Image-Tabular Data
arxiv.org·1d
🤖Advanced OCR
CLGRPO: Reasoning Ability Enhancement for Small VLMs
arxiv.org·2d
📏Linear Logic
CLIP-GS: CLIP-Informed Gaussian Splatting for View-Consistent 3D Indoor Semantic Understanding
arxiv.org·2d
📐Projective Geometry
Accurate and Energy Efficient: Local Retrieval-Augmented Generation Models Outperform Commercial Large Language Models in Medical Tasks
arxiv.org·1h
🌀Brotli Internals
Machine Learning Fundamentals: active learning project
dev.to·13h·
Discuss: DEV
🧠Machine Learning
Semantic Outlier Removal with Embedding Models and LLMs
arxiv.org·3d
🔍Information Retrieval
NaviAgent: Bilevel Planning on Tool Dependency Graphs for Function Calling
arxiv.org·1d
🔗Topological Sorting
Memory Safety in Web Rust System Zero Cost Secure(1750885516953300)
dev.to·7h·
Discuss: DEV
🦀Rust Borrowing
Which Vision Language Models Should You Use for Your Apps
thenewstack.io·2d
🤖Advanced OCR
Referring Expression Instance Retrieval and A Strong End-to-End Baseline
arxiv.org·2d
🔍Semantic Search
Machine Learning Fundamentals: active learning with python
dev.to·11h·
Discuss: DEV
🧠Machine Learning
Data Curation Matters: Model Collapse and Spurious Shift Performance Prediction from Training on Uncurated Text Embeddings
arxiv.org·2d
🗂️Vector Databases
Loading...Loading more...
AboutBlogChangelogRoadmap