GPU Assembly, CUDA ISA, Kernel Optimization, Low-level Programming

New comment by kanwisher in "Ask HN: Who is hiring? (November 2025)"
tensorfleet.net·9h·
Discuss: Hacker News
🤖AI Coding Tools
Flag this post
Radxa Launches AICore DX-M1 Edge AI Accelerator with DeepX DX-M1 NPU
linuxgizmos.com·2d
🎯Tensor Cores
Flag this post
AMD Radeon RX 9070 XT Is Finally Available For $599 With A Free Bonus
hothardware.com·1d
🎮NVIDIA
Flag this post
Makefile vs. YAML: Modernizing verification simulation flows
edn.com·1d
🏗️Build Optimization
Flag this post
Samsung and Nvidia join forces for AI megafactory with 50,000 GPUs
techspot.com·23h
🔍Nsight
Flag this post
Using GNU toolchain for Windows kernel-mode drivers
dev.to·1d·
Discuss: DEV
🏗️Build Systems
Flag this post
Built Datapizza-AI in PHP on 2011 Raspberry Pi: Edge AI Without GPU
dev.to·2d·
Discuss: DEV
✂️CUTLASS
Flag this post
Mind’s Eye Flow Engine — Turning Postgres Into a Thinking System
dev.to·2h·
Discuss: DEV
🔗NCCL
Flag this post
I want to run 8x 5060 ti to run gpt-oss 120b
reddit.com·1d·
Discuss: r/LocalLLaMA
📈Occupancy Optimization
Flag this post
Exchange experience with Claude, 20+ years experience professional using claude code.
reddit.com·11h·
Discuss: r/ClaudeAI
🤖AI Coding Tools
Flag this post
Scaling Coding-Agent RL to 32x H100s. 160% Improvement on Stanford's TBench
github.com·1d·
🤖AI Coding Tools
Flag this post
Predicting Encoding Energy from Low-Pass Anchors for Green Video Streaming
arxiv.org·13h
🔗Kernel Fusion
Flag this post
Honest take: I tested 12+ AI vibe coding tools, but this one actually surprised me
vibe.forem.com·6h·
Discuss: DEV
🤖AI Coding Tools
Flag this post
I made a basic arcade machine with this ESP32-powered display
xda-developers.com·1d
Flash Attention
Flag this post
SK Hynix is already planning Gen 7 SSDs and GDDR7-Next VRAM, as it reveals future schedule
club386.com·6h
🎯GPU Kernels
Flag this post
Real-time Semantic Segmentation for AR Glasses: Dynamic Occlusion Handling via Bayesian Fusion
dev.to·9h·
Discuss: DEV
🏎️TensorRT
Flag this post
Schaltwerk – The IDE Without Editor
github.com·12h·
Discuss: Hacker News
💻CLI Tools
Flag this post
On Designing Low-Latency Systems for High-Traffic Environments
hackernoon.com·1d
🌐Distributed Computing
Flag this post
GIGABYTE sets record DDR5 speed OC world record on Z890 AORUS Tachyon ICE mobo with 13,034 MT/s
tweaktown.com·11h
⏱️Benchmarking
Flag this post
Understanding Solidity Transparent Upgradeable Proxy Pattern - A Practical Guide
github.com·3h·
Discuss: DEV
💡LSP
Flag this post