GPU Assembly, CUDA ISA, Kernel Optimization, Low-level Programming

OpenLoRa: Validating LoRa Implementations Through an Open-Sourced Framework
usenix.org·3h·
Discuss: Hacker News
🚀MLOps
Flag this post
CPUs and GPUs to Become More Expensive After TSMC Price Hike in 2026
guru3d.com·14h·
Discuss: Hacker News
📈Occupancy Optimization
Flag this post
Lazy loading isn't the magic pill to fix AI Inference
tensorfuse-docs.mintlify.dev·21h·
Discuss: Hacker News
🚀MLOps
Flag this post
Real-time stock volatility prediction with deep learning on a time-series DB
medium.com·1d·
Discuss: Hacker News
ONNX Runtime
Flag this post
The Glorious Misadventures of a Linux-illiterate
joshgriffiths.site·15h
🏗️Build Systems
Flag this post
Nvidia, Deutsche Telekom strike €1B partnership for a data center in Munich
techcrunch.com·21h·
Discuss: Hacker News
🏎️TensorRT
Flag this post
Reforging the ReScript Build System
rescript-lang.org·19h·
🏗️Build Optimization
Flag this post
Enforcing Architecture in an Agent-Driven Codebase
phoebe.work·1d·
Discuss: Hacker News
🏗️Build Optimization
Flag this post
A $1 Billion Reason to Buy AMD Stock Now
finance.yahoo.com·7h
🔍Nsight
Flag this post
Self-Improving Vision-Language-Action Models with Data Generation via Residual RL
arxiv.org·1d
🏎️TensorRT
Flag this post
Uncrossed Multiflows and Applications to Disjoint Paths
arxiv.org·1d
📊CUDA Graphs
Flag this post
KTransformers Open Source New Era: Local Fine-tuning of Kimi K2 and DeepSeek V3
reddit.com·23h·
Discuss: r/LocalLLaMA
ONNX Runtime
Flag this post
I'm working on a project I've been dreaming about for months and it feels good
github.com·13h·
Discuss: r/webdev
🤖AI Coding Tools
Flag this post
Planning > Agents: Getting Reliable Code from LLMs
repoprompt.com·10h·
Discuss: Hacker News
🤖AI Coding Tools
Flag this post
LeMiCa: Lexicographic Minimax Path Caching for Efficient Diffusion-Based Video Generation
arxiv.org·1d
Flash Attention
Flag this post
High-Throughput HPLC Method Optimization via Bayesian Neural Network & Predictive Maintenance
dev.to·7h·
Discuss: DEV
⏱️Benchmarking
Flag this post
1 billion JSON records, 1-second query response: Apache Doris vs. ClickHouse, Elasticsearch, and PostgreSQL
dev.to·16h·
Discuss: DEV
🐕Ruff
Flag this post
Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs
arxiv.org·2d
🔗NCCL
Flag this post
The 2-hour upgrade: coder engineer
dev.to·6h·
Discuss: DEV
🤖AI Coding Tools
Flag this post