🧮 Tensor Cores - dane8036 · Scour

APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing

🔲AI,GPU IC, SOC IC Academic

Exploiting GPU Tensor Cores from Java using Babylon [Juan Fumero]

🔲AI,GPU IC, SOC IC

openjdk.org··Lobsters, r/java

Making FlashAttention-4 faster for inference

🔲AI,GPU IC, SOC IC Blog

modal.com··Hacker News

The Inference Alpha: Maximizing Frontier Models on AMD

🧠NPU Blog

digitalocean.com·

NVIDIA RTX Pro 6000 Blackwell: 96GB GDDR7 and the End of VRAM Anxiety

🔲AI,GPU IC, SOC IC Blog

fitservers.com·

Less-relevant results

Release v8.4.66 - Add `nvidia-ml-py` to pyproject.toml (#23922) · ultralytics/ultralytics

📱Edge AI Code

Exploiting GPU Tensor Cores from Java using Babylon

🔲AI,GPU IC, SOC IC

Intel's Open Image Denoise 2.5 Delivers Solid Performance Improvements For GPUs

Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell

🧠NPU News Blog

developer.nvidia.com·

Discrete Diffusion Modelling by Estimating the Ratios of the Data Distribution

🧠NPU News Blog

leetarxiv.substack.com··Substack, r/programming

Framework Desktop AMD 395+ (rdna 3.5) cannot run confyui err Fix 2026

🔲AI,GPU IC, SOC IC Blog

runaihome.com··DEV

NVIDIA chip powers local AI workloads

🔲AI,GPU IC, SOC IC

Rebellions Bets on Memory-Centric Architecture as it Weighs IPO Options

🔲AI,GPU IC, SOC IC News

TileFuse: A Fused Mixed-Precision Kernel Library for Efficient Quantized LLM Inference on AMD NPUs

🔲AI,GPU IC, SOC IC Academic

2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP

🔲AI,GPU IC, SOC IC Blog

dnhkng.github.io·

NVIDIA at Computex 2026: RTX Spark Gaming Hands-On, DLSS 4.5, and More

🔲AI,GPU IC, SOC IC

techpowerup.com·

Anatomy of a high-performance EP kernel

🔲AI,GPU IC, SOC IC Blog

fergusfinn.com··Hacker News

Two Leaps to 1000 Tokens/s on a 1T-Parameter Model: On Inference Systems, Execution Boundaries, and Co-Design

🤖Agentic Engineering Blog

tilert.ai··Hacker News

Nvidia GeForce RTX 50 Super GPUs may launch in early 2027 with 50% more VRAM

🔲AI,GPU IC, SOC IC

NVIDIA's RTX 5060 May Finally Get The VRAM Upgrade Gamers Wanted

🔲AI,GPU IC, SOC IC News

hothardware.com·

Log in to enable infinite scrolling