LAVa: Layer-wise KV Cache Eviction with Dynamic Budget Allocation
arxiv.org·5h
💻Local LLMs
Disaggregated Inference at Scale with PyTorch and VLLM
pytorch.org·1d·
Discuss: Hacker News
LZ4 Streaming
Enhanced Continuous-Time Signal Classification with Adaptive Wavelet Scattering Networks
dev.to·23h·
Discuss: DEV
📊Quantization
Show HN: FSP2 Tested on excerpt "Romeo and Juliet" impressive compresion results
news.ycombinator.com·3d·
Discuss: Hacker News
📝Text Compression
A Kevin week
blog.mitrichev.ch·12h·
📐Linear Algebra
OTW - Bandit Level 4 to Level 5
tbhaxor.com·4h
🔧KAITAI
Language Models Pack Billions of Concepts into 12,000 Dimensions
nickyoder.com·5h·
Discuss: Hacker News
🧮Kolmogorov Complexity
It actually is a snap?
lambdacreate.com·21h
❄️Nix Flakes
Package for http Response buffering
reddit.com·1h·
Discuss: r/golang
gRPC
GStreamer 1.26.6 Brings Fixes for Spotify, Vulkan, and V4L2
linuxiac.com·18h
🎵Audio Streaming
LLM Rerankers for RAG: A Practical Guide
fin.ai·12h·
Discuss: Hacker News
🔍Information Retrieval
Python Multiprocessing: Start Methods, Pools, and Communication
dev.to·4h·
Discuss: DEV
🌊Stream Processing
CoDiCodec: Unifying Continuous and Discrete Compressed Representations of Audio
arxiv.org·5h
🎧Learned Audio
Topological Sort: Managing Mutable Structures in Haskell
mmhaskell.com·1h
🔗Topological Sorting
Hyperdimensional Prime Editing Optimization: Predictive Modeling for Cystic Fibrosis Gene Correction
dev.to·8h·
Discuss: DEV
🧬Copy Number Variants
Cognitive and Gestalt psychology in your code: SMVP pattern
github.com·10h·
Discuss: Hacker News
Format Verification
The future of microoptimization
goldenstack.net·2d·
Discuss: Hacker News
🧮Compute Optimization
Automated Batch Process Optimization via Dynamic Reinforcement Learning and Hyperdimensional Data Fusion
dev.to·1d·
Discuss: DEV
⚙️Batch Processing
a few notes on ratelimiting
dotat.at·1d·
FLAC Verification
Weighted random generation in Python (2010)
eli.thegreenplace.net·12h·
Discuss: Hacker News
🔢Bitwise Algorithms