🎮 GPU Computing - nayyara.airlangga · Scour

RightNow-AI/AutoMegaKernel: An agent harness that compiles a model into one provably-correct, self-retargeting CUDA megakernel and self-tunes it past cuBLAS at batch-1 LLM decode.

🟢CUDA Code

github.com··Hacker News

Nvidia CEO Jensen Huang says the GTX 1080 is "one of my favorites" and a GPU that "changed everything"

💾Shared Memory

pcguide.com··r/pcmasterrace

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

⏱️Prefill Decoding

smolhub.com··r/LocalLLaMA

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16

☁️Cloud Infrastructure

huggingface.co··Hacker News, Hacker News, r/LocalLLaMA

The China Chip Strategy That Is Backfiring on America

💾Shared Memory

techpolicy.press·

Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell

🔢FP8 Training News Blog

developer.nvidia.com·

APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing

🟢CUDA Academic

Core Automation co-founder Jerry Tworek jokes that Nvidia's CUDA translates to miracles in Polish

💾Shared Memory

DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200

🧠Inference Engineering News

newsletter.semianalysis.com

··Hacker News

Jensen Huang says 'every edge device will become autonomous' — Nvidia maps one computing pattern from the cloud to robotics

🧠HBM Bandwidth

tomshardware.com

·

Local AI has a hardware accessibility problem, and the answer to it isn't RTX Spark

💰Inference Cost

xda-developers.com·

Nvidia unveils RTX Spark, advancing AI integration in Windows PCs

💾Shared Memory

cryptobriefing.com·

From GPU to Token: The 8-Layer Observability Stack for AI Infrastructure

💰Inference Cost Blog

Can reinventing the PC actually make a difference? NVIDIA thinks it does

💾Shared Memory

2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP

🧠Inference Engineering Blog

dnhkng.github.io·

Microsoft's Surface Laptop Ultra Announced! #shorts

🧠HBM Bandwidth Video

AI Pains and Gains

💰Inference Cost

thewirechina.com·

Unreleased RTX 3050 Ti graphics card spotted in the wild, GA106 GPU with 6GB VRAM

🧵Warp Scheduling News

tweaktown.com·

Founders on the frontiers of space and robotics show off their gadgets and tell the stories behind them

🏗️Platform Engineering

New comment by ellis0n in "Ask HN: Who wants to be hired? (June 2026)"

💾Shared Memory Discussion

news.ycombinator.com··Hacker News

Sign up or log in to see more results

Log in to enable infinite scrolling