GPU Assembly, CUDA ISA, Kernel Optimization, Low-level Programming

Co-Simulation Framework for Parallel DNN Execution on Chiplet-Based Systems (UW–Madison, Washington State)
semiengineering.com·1d
🌊CUDA Streams
Flag this post
Lazy loading isn't the magic pill to fix AI Inference
tensorfuse-docs.mintlify.dev·15h·
Discuss: Hacker News
🚀MLOps
Flag this post
Dissecting my MiniBanners program – part 1
subethasoftware.com·1d
✂️CUTLASS
Flag this post
Reforging the ReScript Build System
rescript-lang.org·14h·
🏗️Build Optimization
Flag this post
Real-time stock volatility prediction with deep learning on a time-series DB
medium.com·22h·
Discuss: Hacker News
ONNX Runtime
Flag this post
I'd love to see this one laptop feature on all motherboards
xda-developers.com·10h
🏗️Build Optimization
Flag this post
Nvidia, Deutsche Telekom strike €1B partnership for a data center in Munich
techcrunch.com·16h·
Discuss: Hacker News
🏎️TensorRT
Flag this post
AMD confirms its separate drivers for RDNA 1/2 and RDNA 3/4 GPUs will roll out at the same time
tweaktown.com·1d
⏱️CUDA Events
Flag this post
Disciplined Biconvex Programming
arxiv.org·1d
📉Model Quantization
Flag this post
How to build a Heapless Vector using `MaybeUninit<T>` for Better Performance.
dev.to·17h·
Discuss: DEV
🦀PyO3
Flag this post
Minimalistic CLAUDE.md for new projects: Follow SOLID, DRY, YAGNI, KISS
reddit.com·3h·
Discuss: r/ClaudeAI
🏗️Build Optimization
Flag this post
(PR) Giga Computing Announces Worldwide Availability of Its NVIDIA RTX PRO Server
techpowerup.com·7h
🔍Nsight
Flag this post
The 2-hour upgrade: coder engineer
dev.to·1h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
Introducing Spira - Making a Shell #0
github.com·1d·
Discuss: DEV
💡LSP
Flag this post
Enforcing Architecture in an Agent-Driven Codebase
phoebe.work·1d·
Discuss: Hacker News
🏗️Build Optimization
Flag this post
A $1 Billion Reason to Buy AMD Stock Now
finance.yahoo.com·1h
🔍Nsight
Flag this post
Why stop at 1M tokens when you can have 10M?
news.ycombinator.com·18h·
Discuss: Hacker News
Flash Attention
Flag this post
Self-Improving Vision-Language-Action Models with Data Generation via Residual RL
arxiv.org·1d
🏎️TensorRT
Flag this post
Planning > Agents: Getting Reliable Code from LLMs
repoprompt.com·4h·
Discuss: Hacker News
🤖AI Coding Tools
Flag this post