Crushing ML Latency: The (Un)Official Best Practices for Systems Optimisation
pub.towardsai.net·5h
🚀Performance
Flag this post
Generalizing Test-Time Compute-Optimal Scaling as an Optimizable Graph
huggingface.co·6h·
Discuss: Hacker News
🎴TAO
Flag this post
Moving past speculation: How deterministic CPUs deliver predictable AI performance
venturebeat.com·3d
🏗Computer Architecture
Flag this post
flowengineR: A Modular and Extensible Framework for Fair and Reproducible Workflow Design in R
arxiv.org·1d
🔧Data Engineering
Flag this post
Prog8
github.com·18h·
Discuss: Hacker News
💾Retro Computing
Flag this post
How to build a Heapless Vector using `MaybeUninit<T>` for Better Performance.
dev.to·22h·
Discuss: DEV
⚠️Rust Unsafe
Flag this post
My First Multi-GPU Kernel: Writing All-to-All for AMD MI300X
gau-nernst.github.io·2d·
Discuss: Hacker News
Hardware Acceleration
Flag this post
NumPy for Absolute Beginners: A Project-Based Approach to Data Analysis
towardsdatascience.com·14h
🔢NumPy
Flag this post
Why is AI Generated Rust slow when compared with Go/C#/Node/JavaScript
srid68.github.io·19h·
Discuss: Hacker News
🦀Rust
Flag this post
Detailed Technical Documentation on AI Implementation Logic (Taking Large Language Models as an Example )
nbtab.com·1d·
Discuss: DEV
📱Edge AI
Flag this post
Running MiniMax-M2 locally - Existing Hardware Advice
reddit.com·17h·
Discuss: r/LocalLLaMA
🚀Performance
Flag this post
Vectorizing for Fun and Performance
ibm.com·6d·
Discuss: Hacker News
🔀SIMD Programming
Flag this post
[Talk] Improving the Incremental System in the Rust Compiler
blog.goose.love·14h
🔨Incremental Compilation
Flag this post
'No Free Lunch: Deconstruct Efficient Attention with MiniMax M2'
lmsys.org·1d
📱Edge AI
Flag this post
Minimalistic CLAUDE.md for new projects: Follow SOLID, DRY, YAGNI, KISS
reddit.com·8h·
Discuss: r/ClaudeAI
🔨Incremental Compilation
Flag this post
Why stop at 1 million tokens when you can have 10? My journey to extreme context on a gaming GPU. [P]
reddit.com·22h·
📱Edge AI
Flag this post
Disassembling Terabytes of Random Data with Zig and Capstone to Prove a Point
jstrieb.github.io·37m·
Discuss: Hacker News
🔓Binary Exploitation
Flag this post
Enabling Trillion-Parameter Models on AWS EFA
research.perplexity.ai·10h·
Discuss: Hacker News
Hardware Acceleration
Flag this post
Algorithmic Alchemy: Transmuting Dynamic Programming with Gradients by Arvind Sundararajan
dev.to·17h·
Discuss: DEV
📊Dynamic Programming
Flag this post
Boosting React Performance: A Guide to Optimization
dev.to·40m·
Discuss: DEV
📊Performance Tools
Flag this post