LAVa: Layer-wise KV Cache Eviction with Dynamic Budget Allocation
arxiv.org·54m
💻Local LLMs
A Kevin week
blog.mitrichev.ch·7h·
📐Linear Algebra
Language Models Pack Billions of Concepts into 12,000 Dimensions
nickyoder.com·1h·
Discuss: Hacker News
🧮Kolmogorov Complexity
📊Beyond the Standard: Exploring Modern Python Visualization Tools
dev.to·1d·
Discuss: DEV
Bidirectional Programming
On training binary neural networks
kevinmartinjose.com·9h
📊Quantization
A Dumb Introduction to z3. Exploring the world of constraint solvers with very simple examples.
asibahi.github.io·7h·
🧮Z3 Solver
Planarizing matchings
11011110.github.io·10h
🎨Graph Coloring
Disaggregated Inference at Scale with PyTorch and VLLM
pytorch.org·1d·
Discuss: Hacker News
LZ4 Streaming
Weighted random generation in Python (2010)
eli.thegreenplace.net·7h·
Discuss: Hacker News
🔢Bitwise Algorithms
LLM Rerankers for RAG: A Practical Guide
fin.ai·7h·
Discuss: Hacker News
🔍Information Retrieval
LangChain, LangGraph, and LangSmith: Untangling the Confusion
dev.to·4h·
Discuss: DEV
Effect Handlers
Things to build with Google's new Nano Banana image editing and generation model
logankilpatrick.medium.com·13h·
Discuss: Hacker News
Homebrew CPUs
The future of microoptimization
goldenstack.net·2d·
Discuss: Hacker News
🧮Compute Optimization
Cognitive and Gestalt psychology in your code: SMVP pattern
github.com·5h·
Discuss: Hacker News
Format Verification
I built an LLM from Scratch in Rust (Just ndarray and rand)
reddit.com·13h·
Discuss: r/rust
🦀Rust Borrowing
Building High-Performance Caching in Go: A Practical Guide
dev.to·3h·
Discuss: DEV
💨Cache Optimization
Creativity Benchmark: A benchmark for marketing creativity for LLM models
arxiv.org·54m
🧠Intelligence Compression
Review: SpikingBrain Technical Spiking Brain-Inspired Large Models
arxiviq.substack.com·1d·
Discuss: Substack
CPU Microarchitecture
how fast is go? simulating millions of particles on a smart tv
dgerrells.com·1d·
🖥️Game Emulation
Solving LeetCode's "Add Two Numbers" Iteratively and Recursively - Part 1
dev.to·15h·
Discuss: DEV
🔗Topological Sorting