GPU Computing

Feeds to Scour
SubscribedAll
Scoured 238 posts in 12.5 ms

RightNow-AI/AutoMegaKernel: An agent harness that compiles a model into one provably-correct, self-retargeting CUDA megakernel and self-tunes it past cuBLAS at batch-1 LLM decode.

 🟢CUDA  Content type: Code
github.com··Hacker News

Nvidia CEO Jensen Huang says the GTX 1080 is "one of my favorites" and a GPU that "changed everything"

 💾Shared Memory

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

 ⏱️Prefill Decoding
smolhub.com··r/LocalLLaMA

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16

 ☁️Cloud Infrastructure

The China Chip Strategy That Is Backfiring on America

 💾Shared Memory
techpolicy.press·

Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell

 🔢FP8 Training  Content type: News  Content type: Blog
developer.nvidia.com·

APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing

 🟢CUDA  Content type: Academic
arxiv.org·

Core Automation co-founder Jerry Tworek jokes that Nvidia's CUDA translates to miracles in Polish

 💾Shared Memory
digg.com·

DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200

 🧠Inference Engineering  Content type: News

Jensen Huang says 'every edge device will become autonomous' — Nvidia maps one computing pattern from the cloud to robotics

 🧠HBM Bandwidth
tomshardware.com
·

Local AI has a hardware accessibility problem, and the answer to it isn't RTX Spark

 💰Inference Cost
xda-developers.com·

Nvidia unveils RTX Spark, advancing AI integration in Windows PCs

 💾Shared Memory
cryptobriefing.com·

From GPU to Token: The 8-Layer Observability Stack for AI Infrastructure

 💰Inference Cost  Content type: Blog
jimmysong.io·

Can reinventing the PC actually make a difference? NVIDIA thinks it does

 💾Shared Memory
crnasia.com·

2x GH200 for LLM inference, Part 2: vLLM, DeepSeek V4 Flash, and MTP

 🧠Inference Engineering  Content type: Blog
dnhkng.github.io·

Microsoft's Surface Laptop Ultra Announced! #shorts

 🧠HBM Bandwidth  Content type: Video
youtube.com·

AI Pains and Gains

 💰Inference Cost
thewirechina.com·

Unreleased RTX 3050 Ti graphics card spotted in the wild, GA106 GPU with 6GB VRAM

 🧵Warp Scheduling  Content type: News
tweaktown.com·

Founders on the frontiers of space and robotics show off their gadgets and tell the stories behind them

 🏗️Platform Engineering
geekwire.com·

New comment by ellis0n in "Ask HN: Who wants to be hired? (June 2026)"

 💾Shared Memory  Content type: Discussion
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help