LAVa: Layer-wise KV Cache Eviction with Dynamic Budget Allocation
arxiv.org·15h
📋JSON Parsing
Semantic Dictionary Encoding
falvotech.com·4h·
Discuss: Hacker News
🗂️Type Indexing
LLMs on a Shoestring: The Dynamic Cache Advantage by Arvind Sundararajan
dev.to·9h·
Discuss: DEV
💾Cache Algorithms
Language Models Pack Billions of Concepts into 12,000 Dimensions
nickyoder.com·15h·
🌱Minimal ML
Analog IMC Attention Mechanism For Fast And Energy-Efficient LLMs (FZJ, RWTH Aachen)
semiengineering.com·2h
Tokenizer Optimization
Conquering the LLM Memory Wall: How to Run 2–4x Longer Contexts with a Single Line of Code
reddit.com·7h·
Discuss: r/LocalLLaMA
🌪️V8 Pipeline
Show HN: Semlib – Semantic Data Processing
github.com·5h·
Discuss: Hacker News
🔍ML Language
Symmetric MultiProcessing, Hyper-Threading and scheduling on Maestro
blog.lenot.re·11h
Instruction Fusion
Think Different
hackster.io·5h
🐹Minimal Go
More hardware won’t fix bad engineering
infoworld.com·10h
🔮Branch Predictors
Spatial Transcriptomics Data Fusion via Multi-Modal Signal Disambiguation and Graph-Enhanced Integration
dev.to·1d·
Discuss: DEV
🔢Algebraic Datatypes
StringWa.rs on GPUs: Databases & Bioinformatics 🦠
ashvardanian.com·17m·
Discuss: r/programming
🚀Tokenizer Performance
Why some agentic AI developers are moving code from Python to Rust
developers.redhat.com·12h
Interpreter Optimization
[NodeBook] Understanding Buffers in Node.js - Why they exist, where they live in memory, and how they handle binary data
thenodebook.com·12h·
Discuss: r/node
🔢Binary Formats
Crashes are loud. Leaks are quiet.
blog.bitdrift.io·19h
🧠Memory Ordering
Spiking Networks: The Unexpected Shortcut to Smarter AI
dev.to·1h·
Discuss: DEV
🗺️Region Polymorphism
Introducing Segment Anything: Working toward the first foundation model for image segmentation
ai.facebook.com·2h
🧠Semantic Parsing
Speeding up my Ray Tracer using JAX
kayleegeorge.github.io·1h·
Discuss: Hacker News
🔍Lens Libraries
🥬 Freshness Checker AI: The AI-Powered Food Safety Assistant
freshness-checker-ai-460322848371.us-west1.run.app·14h·
Discuss: DEV
🌊Gradual Effects