LAVa: Layer-wise KV Cache Eviction with Dynamic Budget Allocation
arxiv.org·17h
📋JSON Parsing
Semantic Dictionary Encoding
falvotech.com·6h·
Discuss: Hacker News
🗂️Type Indexing
Rapid Assessment of Perovskite Degradation Using Hyperspectral Imaging & Machine Learning
dev.to·57m·
Discuss: DEV
Effect Inference
LLMs on a Shoestring: The Dynamic Cache Advantage by Arvind Sundararajan
dev.to·11h·
Discuss: DEV
💾Cache Algorithms
Language Models Pack Billions of Concepts into 12,000 Dimensions
nickyoder.com·17h·
🌱Minimal ML
Analog IMC Attention Mechanism For Fast And Energy-Efficient LLMs (FZJ, RWTH Aachen)
semiengineering.com·5h
Tokenizer Optimization
Conquering the LLM Memory Wall: How to Run 2–4x Longer Contexts with a Single Line of Code
reddit.com·9h·
Discuss: r/LocalLLaMA
🌪️V8 Pipeline
Show HN: Semlib – Semantic Data Processing
github.com·7h·
Discuss: Hacker News
🔍ML Language
Symmetric MultiProcessing, Hyper-Threading and scheduling on Maestro
blog.lenot.re·13h
Instruction Fusion
A Visual Guide to Tuning Gradient Boosted Trees
towardsdatascience.com·2h
🪜Recursive Descent
Think Different
hackster.io·7h
🐹Minimal Go
More hardware won’t fix bad engineering
infoworld.com·12h
🔮Branch Predictors
StringWa.rs on GPUs: Databases & Bioinformatics 🦠
ashvardanian.com·2h·
🚀Tokenizer Performance
Why some agentic AI developers are moving code from Python to Rust
developers.redhat.com·14h
Interpreter Optimization
[NodeBook] Understanding Buffers in Node.js - Why they exist, where they live in memory, and how they handle binary data
thenodebook.com·14h·
Discuss: r/node
🔢Binary Formats
Spiking Networks: The Unexpected Shortcut to Smarter AI
dev.to·3h·
Discuss: DEV
🗺️Region Polymorphism
Crashes are loud. Leaks are quiet.
blog.bitdrift.io·21h
🧠Memory Ordering
Introducing Segment Anything: Working toward the first foundation model for image segmentation
ai.facebook.com·4h
🧠Semantic Parsing
Verlog: A Multi-turn RL framework for LLM agents
blog.ml.cmu.edu·6h
🎭Erlang OTP