LAVa: Layer-wise KV Cache Eviction with Dynamic Budget Allocation
arxiv.orgยท22h
๐Ÿ’ปLocal LLMs
Iron Vector: 50% Cost Reduction for Apache Flink Workloads
irontools.devยท6hยท
Discuss: Hacker News
๐ŸŒŠStreaming Systems
Genkit Go 1.0: Google brings stable AI framework to the Go ecosystem
heise.deยท8h
๐Ÿ›๏ธAgda
Cap'n Proto - structured data serialziation format
capnproto.orgยท8h
๐Ÿ“‹Protocol Buffers
Semantic Dictionary Encoding
falvotech.comยท11hยท
Discuss: Hacker News
๐ŸŒ€Brotli Dictionary
Learn How to Use Transformers with HuggingFace and SpaCy
towardsdatascience.comยท12h
๐ŸŽฏDependent Parsing
Testing Compression with a Bash Script
gilesorr.comยท1d
๐Ÿ“ฆDeflate
In-depth Review of Emacs tree-sitter integration
archive.casouri.ccยท3hยท
Discuss: Lobsters
๐ŸŒณIncremental Parsing
Lots of people asked about power usage on my post from yesterday, so I overnighted a wattage meter.
reddit.comยท9hยท
Discuss: r/homelab
๐Ÿ“ŠHomelab Monitoring
The many, many, many JavaScript runtimes of the last decade
shapeof.comยท8h
๐Ÿ—๏ธCompiler Archaeology
Show HN: Semlib โ€“ Semantic Data Processing
github.comยท12hยท
Discuss: Hacker News
๐ŸŒณIncremental Parsing
Automated Data Lineage Reconstruction via Multi-Modal Graph Analysis & HyperScore Validation
dev.toยท7hยท
Discuss: DEV
๐Ÿ”—Data Provenance
Why you should care about the JDBC fetch size
in.relation.toยท14hยท
Discuss: r/programming
๐ŸŒŠStreaming Databases
I Ran Local LLMs on My Android Phone
itsfoss.comยท14h
๐Ÿ’ปLocal LLMs
Fighting human trafficking with self-contained applications
lwn.netยท2hยท
Discuss: Hacker News
๐Ÿ”ฉSystems Programming
LLM Rerankers for RAG: A Practical Guide
fin.aiยท1dยท
๐Ÿ”Information Retrieval
Text-to-SQL Oriented to the Process Mining Domain: A PT-EN Dataset for Query Translation
arxiv.orgยท22h
๐Ÿ“‹Document Grammar
Language Models Pack Billions of Concepts into 12,000 Dimensions
nickyoder.comยท22hยท
๐ŸงฎKolmogorov Complexity
LLMs on a Shoestring: The Dynamic Cache Advantage by Arvind Sundararajan
dev.toยท16hยท
Discuss: DEV
๐Ÿ’จCache Optimization
Two Axes, Four Patterns: How Teams Actually Do GPU Binpack/Spread on K8s (w/ DRA context)
reddit.comยท12hยท
Discuss: r/kubernetes
๐Ÿ”ŒOperating system internals