The Real Cost of LLM Inference: Memory Bandwidth, Not FLOPs
dev.to·1d·
Discuss: DEV
🗺️Region Inference
Flag this post
Multi-Core Architecture Optimized For Time-Predictable Neural Network Inference (FZI, KIT)
semiengineering.com·1d
🔮CPU Branch Prediction
Flag this post
An overview of memory management in Go (2021)
medium.com·14h·
Discuss: Hacker News
📚Stack Data Structures
Flag this post
Trying Out C++26 Executors
mropert.github.io·12h·
🔮Speculative Execution
Flag this post
Parallel C++ for Scientific Applications: Linear Algebra in C++
reddit.com·1d·
Discuss: r/cpp
🔀SIMD Programming
Flag this post
Accelerating Controllable Generation via Hybrid-grained Cache
arxiv.org·6d
Cache-Aware Algorithms
Flag this post
SK Hynix massively increases DRAM production, but the global memory bottleneck persists
igorslab.de·3h
🔮Speculative Execution
Flag this post
Radxa Unveils Solder-Down rCore Module Line With RK3308 and IQ-9075 Edge AI Variants
linuxgizmos.com·10h
🔌Microcontrollers
Flag this post
Why I Ditched Caffeine for JCacheX in My Spring Boot Microservices
dev.to·2h·
Discuss: DEV
🔗Weak References
Flag this post
Zoomer: Powering AI Performance at Meta’s Scale Through Intelligent Debugging and Optimization
engineering.fb.com·1d
📈Performance Tools
Flag this post
Strix Halo, Debian 13@6.16.12&6.17.8, Qwen3Coder-Q8 CTX<=131k, llama.cpp@Vulkan&ROCm, Power & Efficiency
i.redd.it·7h·
💪ARM64 Backend
Flag this post
The Engineering Guide to Efficient LLM Inference: Metrics, Memory, and Mathematics
pub.towardsai.net·2d
🗺️Region Inference
Flag this post
Modern X86 Assembly Language Programming • Daniel Kusswurm & Matt Godbolt • GOTO 2025
youtube.com·2d
🔧Assembly DSLs
Flag this post
Meditations on geometric packing
shvbsle.in·22h
🌊Effect Rows
Flag this post
On Thread Synchronization : Part 1 - A deep dive into mutexes
sayujya-apte.github.io·22h·
Discuss: r/programming
🔗Concurrency Primitives
Flag this post
Making SLH-DSA 10x-100x Faster
conduition.io·8h
🔗Hash Algorithms
Flag this post
Challenges compiling old C++ code on modern Linux
smalldatum.blogspot.com·15h·
🗃️Query Compilation
Flag this post
Global Optimization: Finding the Needle in a Haystack – Faster by Arvind Sundararajan
dev.to·5h·
Discuss: DEV
🔍Search Algorithms
Flag this post
Live Webinar: Considerations When Architecting Your Next SoC: NoC with Arteris and Aion Silicon
semiwiki.com·2d
🔮CPU Branch Prediction
Flag this post