GPGPU

Feeds to Scour
SubscribedAll
Scoured 215 posts in 7.0 ms

Exploiting GPU Tensor Cores from Java using Babylon [Juan Fumero]

 ⚡CUDA
openjdk.org··r/java

CUDA-Oxide 0.2 Brings Early Improvements To Pure Rust CUDA Kernels

 🔀Parallel Computing
phoronix.com·

Exploiting GPU Tensor Cores from Java using Babylon

 🔀Parallel Computing
inside.java·

Framework Desktop AMD 395+ (rdna 3.5) cannot run confyui err Fix 2026

 ⚡CUDA  Content type: Blog
runaihome.com··DEV

Release TorchCodec 0.14: HDR Video Decoding for CPU & CUDA, and Fast Wav Decoder · meta-pytorch/torchcodec

 🔀Parallel Computing  Content type: Code
github.com··Hacker News

NVIDIA Nsight Compute

 🔀Parallel Computing
developer.nvidia.com·

APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing

 ⚡CUDA  Content type: Academic
arxiv.org·

NVIDIA chip powers local AI workloads

 ⚡CUDA
edn.com·

Flatpak 1.18 adds AMD ROCm support, improved error output, and faster Fish shell start-up

 ⚡CUDA
alternativeto.net·

Full Context on a Vulkan-Only Strix Halo: The Decode-Drop Reproduces, but the Sweet Spot Moves

 🎯Low Latency
thefrontierlab.ai··Hacker News

HydraMPP: A lightweight library for distributed massive parallel processing in Python - threading at scale.

 🔀Parallel Computing  Content type: Academic
biorxiv.org·

Core Automation co-founder Jerry Tworek jokes that Nvidia's CUDA translates to miracles in Polish

 🔀Parallel Computing
digg.com·

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

 🤖ML Systems  Content type: Blog
blogs.nvidia.com·

NVIDIA’s New RTX Spark Superchip Changes Everything for On-the-Go 12K Video Editing and 3D Rendering

 🔀Parallel Computing
canonrumors.com·

New comment by bhvk08 in "Ask HN: Who wants to be hired? (June 2026)"

 🔀Parallel Computing  Content type: Discussion

AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support

 🔀Parallel Computing
phoronix.com·

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

 🔀Parallel Computing
smolhub.com··r/LocalLLaMA

DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200

 🤖ML Systems  Content type: News

Nvidia RTX Spark: The $2,900 Floor Tells You Everything

 🔀Parallel Computing  Content type: Blog  Content type: Discussion
tildalice.io·

From GPU to Token: The 8-Layer Observability Stack for AI Infrastructure

 🔀Parallel Computing  Content type: Blog
jimmysong.io·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help