'No Free Lunch: Deconstruct Efficient Attention with MiniMax M2'
lmsys.org·1d
📱Edge AI
Flag this post
An introduction to program synthesis (Part II) - Automatically generating features for machine learning
🎭Program Synthesis
Flag this post
Identifying Linux Kernel Instability Due to Poor RCU Synchronization
arxiv.org·1d
🔒Futex
Flag this post
How Datadog Built a Custom Database to Ingest Billions of Metrics Per Second
blog.bytebytego.com·23h
🗄️Database Engines
Flag this post
Geonum – geometric number library for unlimited dimensions with O(1) complexity
📏Linear Types
Flag this post
If your app > 100KB, delete your GitHub
🌳Git
Flag this post
Low-Level Hacks
🦀Rust
Flag this post
Crushing ML Latency: The (Un)Official Best Practices for Systems Optimisation
pub.towardsai.net·9h
🚀Performance
Flag this post
Zensical – A modern static site generator built by the Material for MkDocs team
🗂️Obsidian
Flag this post
TIL: For long-lived LLM sessions, swapping KV Cache to RAM is ~10x faster than recalculating it. Why isn't this a standard feature?
🏗️NUMA
Flag this post
Writing a DOS Clone in 2019
🖥️Emulation
Flag this post
MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning
arxiv.org·10h
💬Prompt Engineering
Flag this post
Why stop at 1M tokens when you can have 10M?
🚀Performance
Flag this post
Loading...Loading more...