๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐Ÿ“ Text Compression

Huffman Coding, LZ77, Burrows-Wheeler, Dictionary Compression, Grammar-Based

An unfinished post: "Compressing short Unicode strings with BOCU-1"
evanhahn.comยท1d
๐Ÿ”คCharacter Encoding
Structured Output for Beginners and 3 Prompting Tips
pocketflow.substack.comยท7hยท
Discuss: Substack
๐ŸŒณIncremental Parsing
Privacy-Shielded Image Compression: Defending Against Exploitation from Vision-Language Pretrained Models
arxiv.orgยท1d
๐Ÿง Learned Compression
A Gentle Introduction to Multi-Head Attention and Grouped-Query Attention
machinelearningmastery.comยท11h
๐Ÿš€SIMD Text Processing
Enhancing Content Diversity with NLP-Based Clustering
hackernoon.comยท15h
๐Ÿ“šDocument Clustering
Recent optimizations on integer to string conversions
reddit.comยท16hยท
Discuss: r/rust
๐ŸงฎAlgebraic Datatypes
Linters, Formatters, and Type-Checkers
playfulprogramming.comยท1d
๐ŸŽฏGradual Typing
graven-image: Portability library for CL image in REPL
github.comยท13hยท
Discuss: Lobsters
๐Ÿฆ€Rust Macros
4 o4-mini-high Prompts saved me $100-200/yr and yet another SaaS app
suthakamal.substack.comยท12hยท
Discuss: Substack
๐Ÿ“šLempel-Ziv
# Is 100% AI-Assisted Software Development Possible? โ€“ A Real Experience
dev.toยท5hยท
Discuss: DEV
๐Ÿ“ฆDeflate
Thunder-Tok: Minimizing Tokens per Word in Tokenizing Korean Texts for Generative Language Models
arxiv.orgยท1d
๐Ÿ“Text Parsing
What I learned from the book Designing Data-Intensive Applications
newsletter.techworld-with-milan.comยท16hยท
Discuss: r/compsci, r/programming
๐Ÿ—„๏ธDatabase Internals
Amped FIVE Update 37757: Writing Queue, Camera Calibration, RIFF Viewer, Timing Source for Video Writer and Much More
blog.ampedsoftware.comยท18h
โฑ๏ธSMPTE Timecode
Java, What's Old? Part I: Collections
foojay.ioยท16hยท
Discuss: Hacker News
๐Ÿ”ขBinary Formats
Finished checking and fixing up the diagnostic load file
rescue1130.blogspot.comยท20hยท
Discuss: rescue1130.blogspot.com
๐Ÿ”คCharacter Encoding
Open-source 3B param model better than Mistral OCR
huggingface.coยท4dยท
Discuss: Hacker News
๐Ÿค–Advanced OCR
Evaluating Google Gemini for Document OCR Using Hugging Face Invoice Dataset
dev.toยท17hยท
Discuss: DEV
๐Ÿ“„OCR
Kyutai STT โ€“ A speech-to-text optimized for real-time usage
kyutai.orgยท10hยท
Discuss: Hacker News
๐ŸŽ™๏ธWhisper
Will long context windows solve all your problems?
frontierai.substack.comยท13hยท
Discuss: Substack
๐Ÿ’ปLocal LLMs
Meeting summarization and action item extraction with Amazon Nova
aws.amazon.comยท1d
๐Ÿ“ŠFeed Optimization
Loading...Loading more...
AboutBlogChangelogRoadmap