Boosting LLM Performance with Tiered KV Cache on Google Kubernetes Engine
cloud.google.com·10h
📈Machine Learning
Flag this post
Designing smarter caches with Valkey 9.0's numbered databases
dev.to·5h·
Discuss: DEV
👩‍💻Programming
Flag this post
Mount Mayhem at Netflix: Scaling Containers on Modern CPUs
netflixtechblog.medium.com·2h·
Discuss: Hacker News
👩‍💻Programming
Flag this post
Pool allocator in C++23 for simulations / game engines - faster than std::pmr
github.com·1d·
Discuss: r/programming
🧮Algorithms
Flag this post
Memory Matters: The State of Embedded NVM (eNVM) 2025
semiwiki.com·1d
🧮Algorithms
Flag this post
Attention Is All You Need for KV Cache in Diffusion LLMs
paperium.net·3d·
Discuss: DEV
🧮Algorithms
Flag this post
Which Chip Is Best?
blog.confident.security·1d·
Discuss: Hacker News
🧮Algorithms
Flag this post
One Size Does Not Fit All: Architecture-Aware Adaptive Batch Scheduling with DEBA
arxiv.org·17h
🧮Algorithms
Flag this post
MySQL Memory Management and Replication Best Practices for High-Load Environments
dev.to·8h·
Discuss: DEV
🗄️Databases
Flag this post
Supercharging Real-Time Applications with TiDB and DragonflyDB
pingcap.com·3h
👩‍💻Programming
Flag this post
A Shallow Introduction to Queueing Theory
thefridaydeploy.substack.com·12h·
Discuss: Substack
🧮Algorithms
Flag this post
🏗️ Hardware Memory bandwidth is becoming the choke point slowing down GenAI.
threadreaderapp.com·2d
🧮Algorithms
Flag this post
My works run slower on pre-production than in test – why?
colinpaice.blog·10h
Performance
Flag this post
Non-recursively deleting a binary tree in constant space: Restructuring the tree
devblogs.microsoft.com·7h·
Discuss: Hacker News
🧮Algorithms
Flag this post
Progress Update 1.22 - Optimising the Engine 🛠️
fallahn.itch.io·1d
🧮Algorithms
Flag this post
Anukari on the CPU (part 2: CPU optimization)
anukari.com·2h·
Discuss: Hacker News
👩‍💻Programming
Flag this post
Inside Pinecone: Slab Architecture
pinecone.io·3d·
Discuss: Hacker News
🧮Algorithms
Flag this post
SOLIDWORKS PDM: Best Practices for Cache Management
cad-store.net·2h·
Discuss: DEV
👩‍💻Programming
Flag this post
13 Arguments About a Transition to Neuralese AIs
lesswrong.com·6h
🤖AI
Flag this post