GPU Assembly, CUDA ISA, Kernel Optimization, Low-level Programming

Achieving analog precision via components and design, or just trim and go
edn.com·1d
⏱️Benchmarking
Flag this post
Nvidia Server-Maker Hon Hai’s Sales Rise 11% as Demand Persists
bloomberg.com·15h
🎮NVIDIA
Flag this post
How Amazon Search increased ML training twofold using AWS Batch for Amazon SageMaker Training jobs
aws.amazon.com·5h
🤖AI Coding Tools
Flag this post
Microsoft Ships 60,000 Nvidia GB300 GPUs to UAE Under Safeguarded Licenses
windowsforum.com·2d
🌊CUDA Streams
Flag this post
SUSE Enterprise Linux 16 is here, and its killer feature is digital sovereignty
zdnet.com·1d·
Discuss: Hacker News
🏗️Build Systems
Flag this post
Nvidia AI: A Revolutionary €1 Billion Boost For Germany's Digital Future
bitcoinworld.co.in·8h
🧠BF16
Flag this post
My computer has $18 million worth of RAM in it
aardvark.co.nz·6h
Flash Attention
Flag this post
Get Ready for Clojure, GPU, and AI in 2026 with CUDA 13.0
dragan.rocks·6d·
Discuss: Hacker News
⏱️CUDA Events
Flag this post
Curious about real local LLM workflows: What’s your setup?
reddit.com·15h·
Discuss: r/LocalLLaMA
🚀MLOps
Flag this post
Understanding Hetzner SSD VPS Performance and Best Practices
dev.to·6h·
Discuss: DEV
📈GPU Occupancy
Flag this post
Agentic DevOps: I Let GitHub Copilot Run My Entire CI/CD Pipeline (And Lived to Tell the Tale)
dev.to·16h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
Mind the Gap: The API Layer That Shouldn’t Exist
dev.to·56m·
Discuss: DEV
💡LSP
Flag this post
Building Scalable Online Gaming Platforms: A Developer’s Look into Turnkey Tech Stacks
dev.to·1d·
Discuss: DEV
⏱️CUDA Events
Flag this post
KTransformers Open Source New Era: Local Fine-tuning of Kimi K2 and DeepSeek V3
reddit.com·1d·
Discuss: r/LocalLLaMA
ONNX Runtime
Flag this post
I'm working on a project I've been dreaming about for months and it feels good
github.com·1d·
Discuss: r/webdev
🤖AI Coding Tools
Flag this post
Geonum – geometric number library for unlimited dimensions with O(1) complexity
github.com·2d·
Discuss: Hacker News
✂️CUTLASS
Flag this post
LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation
arxiv.org·1d
Flash Attention
Flag this post
AMD confirms its separate drivers for RDNA 1/2 and RDNA 3/4 GPUs will roll out at the same time
tweaktown.com·1d
⏱️CUDA Events
Flag this post
High-Throughput HPLC Method Optimization via Bayesian Neural Network & Predictive Maintenance
dev.to·18h·
Discuss: DEV
⏱️Benchmarking
Flag this post