My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.io·1d·
Discuss: Hacker News
Performance Optimization
Flag this post
Free Functions Don't Change Performance (Much)
16bpp.net·1d·
Discuss: Hacker News, r/cpp
🚀Performance
Flag this post
Microservices? No, modularity is what matters
binaryigor.com·7h·
Discuss: Hacker News
🏗️Build Systems
Flag this post
Inside Pinecone: Slab Architecture
pinecone.io·3h·
Discuss: Hacker News
🚀Performance
Flag this post
Show HN: Polyglot Docker dev environment setup – C/C++/Rust/Python
github.com·15h·
Discuss: Hacker News
🏗️Build Systems
Flag this post
Advice for getting into programming of hardware
reddit.com·16h·
Discuss: r/hardware
📟Embedded Systems
Flag this post
Autark: Rethinking build systems – Integrate, Don't Outsource
blog.annapurna.cc·4h·
Discuss: Hacker News
🏗️Build Systems
Flag this post
Lazy loading isn't the magic pill to fix AI Inference
tensorfuse-docs.mintlify.dev·5h·
Discuss: Hacker News
🚀Performance
Flag this post
Linux/WASM
joelseverin.github.io·2d·
🧠Memory Management
Flag this post
Attention Is All You Need for KV Cache in Diffusion LLMs
paperium.net·15h·
Discuss: DEV
Performance Optimization
Flag this post
How to Design Efficient Memory Architectures for Agentic AI Systems
pub.towardsai.net·1h
🧠Memory Management
Flag this post
Machine Scheduler in LLVM – Part II
myhsu.xyz·2d·
Performance Optimization
Flag this post
Showcase: In Memoria - Rust core with TypeScript/NAPI interface for high-performance AI tooling
reddit.com·4h·
Discuss: r/rust
⚙️Compilers
Flag this post
Cons Should Not Cons Its Arguments, Part II: Cheney on the MTA
web.archive.org·1d·
Discuss: Hacker News
⚙️Compilers
Flag this post
Computer Science Fundamentals: From Binary Systems to Algorithms
dev.to·4h·
Discuss: DEV
🔀Parallel Computing
Flag this post
Defeating KASLR by Doing Nothing at All
googleprojectzero.blogspot.com·1d·
🧠Memory Management
Flag this post
Linux 6.19 To Optimize Exiting To User-Space For Restartable Sequences
phoronix.com·21h
Performance Optimization
Flag this post
The Microsoft SoftCard for the Apple II: Getting two processors to share the same memory
devblogs.microsoft.com·5h
🧠Memory Management
Flag this post
PyTorch Team Introduces Cluster Programming
i-programmer.info·2h
🔀Parallel Computing
Flag this post
Exploring a space-based, scalable AI infrastructure system design
research.google·3h·
Discuss: Hacker News
📟Embedded Systems
Flag this post