SIMD, Vector Instructions, CPU Optimization, Performance

My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.io·16h·
Discuss: Hacker News
Model Efficiency
Flag this post
Why Multimodal AI Broke the Data Pipeline — And How Daft Is Beating Ray and Spark to Fix It
hackernoon.com·12h
Model Efficiency
Flag this post
Geonum – geometric number library for unlimited dimensions with O(1) complexity
github.com·3h·
Discuss: Hacker News
Model Efficiency
Flag this post
Free Functions Don't Change Performance (Much)
16bpp.net·3h·
Discuss: Hacker News, r/cpp
Model Efficiency
Flag this post
Essential Things to Know Before Upgrading Your Computer Memory
buysellram.com·59m·
Discuss: Hacker News
Model Efficiency
Flag this post
Cure - Verification-First Programming for the BEAM
cure-lang.org·6h·
Discuss: Lobsters
Model Efficiency
Flag this post
The next RISC-V processor frontier: AI
edn.com·3d·
Discuss: Hacker News
Model Efficiency
Flag this post
How KVM and QEMU run VMs in Linux
popovicu.com·1d·
Discuss: r/linux
Model Efficiency
Flag this post
Vectorizing for Fun and Performance
ibm.com·5d·
Discuss: Hacker News
Model Efficiency
Flag this post
How to get the GOT address from a PLT stub using GDB
rafaelbeirigo.github.io·1d·
Discuss: Hacker News
💻Tech
Flag this post
Can-t stop till you get enough
cant.bearblog.dev·23h·
Discuss: Hacker News
🤖AI
Flag this post
CHERIoT 1.0 Released
cheriot.org·42m·
Discuss: Hacker News
Model Efficiency
Flag this post
Kimi Linear: An Expressive, Efficient Attention Architecture
arxiviq.substack.com·1d·
Discuss: Substack
LLM Optimization
Flag this post
MobileNetV3 Paper Walkthrough: The Tiny Giant Getting Even Smarter
towardsdatascience.com·1d
Model Efficiency
Flag this post
A hitchhiker's guide to CUDA programming
seanzhang.me·3d·
Discuss: Hacker News
Model Efficiency
Flag this post
ClipTagger-12B VLM: Frame Captioning Tutorial
dev.to·1d·
Discuss: DEV
LLM Optimization
Flag this post
We found embedding indexing bottleneck in the least expected place: JSON parsing
nixiesearch.substack.com·55m·
Discuss: Substack
LLM Optimization
Flag this post
Cycle-accurate 6502 emulator as coroutine in Rust
github.com·2d·
✍️Prompt Engineering
Flag this post
Show HN: GPU-accelerated sandboxes for running AI coding agents in parallel [video]
youtube.com·3d·
Discuss: Hacker News
🤖AI
Flag this post
Spiking Neural Networks: The Future of Brain-Inspired Computing
arxiv.org·12h
✍️Prompt Engineering
Flag this post