🎯 GPU Kernels - miterion · Scour

Beyond a Single Queue: Multi-Level-Multi-Queue as an Effective Design for SSSP problems on GPUs

arxiv.org·3d

🌊CUDA Streams

AMD Ryzen 7 9850X3D vs Ryzen 7 9800X3D faceoff — an extra $30 buys you very little performance

tomshardware.com

·4h

a Linux VM manager with easy GPU-passthrough and more

vm-curator.org·1d·

Discuss: Hacker News

NVIDIA DGX Spark Powers Big Projects in Higher Education

blogs.nvidia.com·2d

Moss: A Linux-compatible Rust async kernel, 3 months on

news.ycombinator.com·1d·

Discuss: Hacker News

Oxide plans new rack attack, packing in Zen 5 CPUs and DDR5 RAM

theregister.com·17h·

Discuss: Hacker News

🧠CPU Architecture

Breaking the Tractability Barrier: A Generic Low-Level Solver for NP-Hard Instances (N=63) on Commodity 64-Bit Silicon

zenodo.org·1d·

Discuss: Hacker News

🎯Tensor Cores

BOute: Cost-Efficient LLM Serving with Heterogeneous LLMs and GPUs via Multi-Objective Bayesian Optimization

arxiv.org·2d

From hand-tuned to generated: A reproducible Triton GPU kernel benchmark across different vendors

next.redhat.com·1d

⏱️CUDA Events

NVIDIA RTX 5070 vs Radeon RX 9070: Which GPU should you buy in 2026?

tech.sportskeeda.com·3h

The 5 Distributed Training Methods: How to Train Models Too Large for One GPU

pub.towardsai.net

·1d

OpenAI GPT-5.3-Codex-Spark Now Running at 1K Tokens Per Secondon BIG Cerebras Chips

servethehome.com·21h·

Discuss: Hacker News

⚡Flash Attention

Nvidia Deepens AI Inference Push With Groq Deal And Rubin Platform

finance.yahoo.com·1d

AI, GPU, And HPC Data Centers: The Infrastructure Behind Modern AI

semiengineering.com·2d

⏱️CUDA Events

Show HN: GPU ROI simulator based on token usage and model architecture

axiomos.ai·4d·

Discuss: Hacker News

📈GPU Occupancy

Nvidia’s new technique cuts LLM reasoning costs by 8x without losing accuracy

venturebeat.com·1d·

Discuss: r/LocalLLaMA

Building a Zero-Dependency secp256k1 CUDA Engine from Scratch (2.5B ops/SEC)

github.com·3d·

Discuss: Hacker News

Nvidia-Leased Data Center Wraps Up In-Demand $3.8B Bond

bloomberg.com

·15h

Linux 7.0 MM Changes Bring Some Very Nice Performance Optimizations

phoronix.com·1d

📊Profiling Tools

NVIDIA RTX 6000D PCB Spotted With 84GB GDDR7 Using 28x 3GB Chip Configuration

eteknix.com·20h

Loading more...