OBCache: Optimal Brain KV Cache Pruning for Efficient Long-Context LLM Inference
arxiv.org·22h
🧠LLM Inference
Understanding conflict resolution and avoidance in PostgreSQL: a complete guide
pgedge.com·6h·
Discuss: r/programming
🔄Eventual Consistency
How View Caching in Rails Works (2020)
honeybadger.io·12h·
Discuss: Hacker News
💾Prompt Caching
Looking at my Arduino
boswell.bearblog.dev·9h
🖥️Hardware Architecture
GPT-OSS from Scratch on AMD GPUs
reddit.com·4h·
Discuss: r/LocalLLaMA
🖥GPUs
Explicit Lossless Vertex Expanders!
gilkalai.wordpress.com·16h
🔬RaBitQ
Can AI Co-Design Distributed Systems? Scaling from 1 GPU to 1k
harvard-edge.github.io·4h·
Discuss: Hacker News
🌐Distributed systems
PostGIS Performance: Indexing and EXPLAIN
crunchydata.com·12h
🔍Query Optimization
A new method to build more energy-efficient memory devices could lead to a sustainable data future
phys.org·17h
🏭TSMC
QUIC! Jump to User Space!
hackaday.com·10h
QUIC Protocol
The DINOv3 Playbook for Computer Vision Data Science
pub.towardsai.net·13h
📊Vector Databases
(Forward) automatic implicit differentiation in Rust with num-dual 0.12.0
reddit.com·10h·
Discuss: r/rust
🎭Rust Macros
Slip – A Lisp System in JavaScript
lisperator.net·12h·
Discuss: Hacker News
💻Programming languages
How Do SSDs Work?
extremetech.com·14h·
Discuss: Hacker News
⚙️Mechanical Sympathy
Scaling Time-Series Data for AI Models
singlestore.com·11h
🎛️Feed Filtering
Proximity Lock System
producthunt.com·19h
💻CLI Tools
Progress being made in porting AMD OpenSIL Turin PoC to Coreboot in a Gigabyte MZ33-AR1
blog.3mdeb.com·6h·
🖥GPUs
Fast and robust drift correction for single-molecule localization microscopy
nature.com·15h
🕯️Candle