📄 Text Chunking - matmat · Scour

perrotta.dev·3d

📄Document Streaming

A Ralph Loop for Reading: Beating GPT 5.2 with a 4k Context Window (and 4 GPUs)

stevehanov.ca·5d

🌊Streaming Algorithms

Document Reconstruction Unlocks Scalable Long-Context RLVR

arxiv.org·1d

🤖Manuscript AI

MeDocVL: A Visual Language Model for Medical Document Understanding and Parsing

arxiv.org·2d

📝Document Chunking

q3m – A what3words-like geocoding library for France, using 3 French words

reddit.com·3d·

Discuss: r/golang

🔶Voronoi Diagrams

CNN-based Segmentation of Medical Imaging Data

dev.to·3d·

Discuss: DEV

🌀Riemannian Computing

The UI: Why It's the Real AI Agent Bottleneck

hackernoon.com·2d

📐Proof Assistants

Vector Databases Explained: Architecture and System Design for AI Apps

dev.to·2d·

Discuss: DEV

🗂️Vector Databases

I kept highlighting on Kindle and never reusing them, so I built a small tool

litmarks.ai·2d·

Discuss: Hacker News

Show HN: Fine-tuned Qwen2.5-7B on 100 films for probabilistic story graphs

cinegraphs.ai·3d·

Discuss: Hacker News

🕸️Knowledge Graphs

Transform Books into Interactive Courses

book2course.org·3d·

Discuss: Hacker News

🏛Digital humanities

Show HN: Seedream 5.0: free AI image generator that claims strong text rendering

seedream5ai.org·3d·

Discuss: Hacker News

🌊Streaming Algorithms

Show HN: Distill – AI summaries and Worth It scores for YouTube videos

chromewebstore.google.com·1d·

Discuss: Hacker News

🗜️LZW Variants

Building a Zero-Allocation, SIMD-Accelerated CSV Parser in Zig

peymanmo.com·1d·

Discuss: Hacker News

🚀SIMD Text Processing

Webspace Invaders

matthiasott.com·2d·

Discuss: Lobsters, Hacker News

🌐WARC Forensics

Show HN: Readability API

unrender.page·3d·

Discuss: Hacker News, r/SideProject

👁️Constructive OCR

EFTA00400459 has been cracked, DBC12.pdf liberated

neosmart.net·3d·

Discuss: Hacker News

👁️Constructive OCR

Technical Details of My LLM-Generated Book

mattbruenig.com·1d·

Discuss: Hacker News

📝Concrete Syntax

Your Agent’s Memory Is Broken. Here’s Why.

ramsriharsha.substack.com·4d·

Discuss: Substack

🗄️Database Internals

Achieving Ultra-Fast AI Chat Widgets

cjroth.com·3d·

Discuss: Hacker News

Loading more...