Attention Is All You Need for KV Cache in Diffusion LLMs
paperium.net·15h·
Discuss: DEV
💾Cache Design
Flag this post
Myths Programmers Believe about CPU Caches
software.rajivprab.com·4d·
Discuss: Hacker News
⚙️Systems Programming
Flag this post
Predicting & Mitigating Data Corruption in Pure Storage Flash Arrays via Adaptive Bit Error Rate Modeling
dev.to·9h·
Discuss: DEV
🔌Embedded Systems
Flag this post
Moving past speculation: How deterministic CPUs deliver predictable AI performance
venturebeat.com·2d
SIMD
Flag this post
How to Design Efficient Memory Architectures for Agentic AI Systems
pub.towardsai.net·1h
🛡️Memory Safety
Flag this post
Inside Pinecone: Slab Architecture
pinecone.io·3h·
Discuss: Hacker News
🗄️Database Internals
Flag this post
Cutting LLM Batch Inference Time in Half: Dynamic Prefix Bucketing at Scale
daft.ai·2h·
Discuss: Hacker News
🔢algo
Flag this post
Don't let these 3 CPU specs trick you into paying more
xda-developers.com·1d
Performance Engineering
Flag this post
Showcase: In Memoria - Rust core with TypeScript/NAPI interface for high-performance AI tooling
reddit.com·4h·
Discuss: r/rust
🕸️WebAssembly
Flag this post
On Designing Low-Latency Systems for High-Traffic Environments
hackernoon.com·1d
⚖️Load Balancing
Flag this post
My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.io·1d·
Discuss: Hacker News
🗑️Garbage Collection
Flag this post
Low-Level Hacks
blog.raycursive.com·17h·
Discuss: Hacker News
⚙️Systems Programming
Flag this post
Co-Simulation Framework for Parallel DNN Execution on Chiplet-Based Systems (UW–Madison, Washington State)
semiengineering.com·23h
🔌Embedded Systems
Flag this post
Reliability assessment of multi-performance system incorporating multiple common buses and transformation devices
sciencedirect.com·4h
🔌Embedded Systems
Flag this post
Show HN: Polyglot standard library HTTP client C/C++/Rust/Python and benchmarks
github.com·15h·
Discuss: Hacker News
🔨Compiler Design
Flag this post
Benchmarking the cost of Java's EnumSet - A Second Look
kinnen.de·43m·
Discuss: r/programming
#️⃣Hash Tables
Flag this post
Parallel achieves 70% accuracy on SEAL, benchmark for hard web research
parallel.ai·56m·
Discuss: Hacker News
Performance Engineering
Flag this post
Detailed Technical Documentation on AI Implementation Logic (Taking Large Language Models as an Example )
nbtab.com·11h·
Discuss: DEV
📝Parsing
Flag this post
Running MiniMax-M2 locally - Existing Hardware Advice
reddit.com·3h·
Discuss: r/LocalLLaMA
⚙️Systems Programming
Flag this post