GPU Computing

Feeds to Scour
SubscribedAll
Scoured 178 posts in 6.7 ms

Exploiting GPU Tensor Cores from Java using Babylon [Juan Fumero]

 Concurrency
openjdk.org··Lobsters, r/java

CUDA-Oxide 0.2 Brings Early Improvements To Pure Rust CUDA Kernels

 🐧Kernel
phoronix.com·

Making FlashAttention-4 faster for inference

 ✍️Prompt Engineering  Content type: Blog
modal.com··Hacker News

Gerrymandering the Warp: Non-Control-Data Attacks on CUDA Collective Decision

 Concurrency  Content type: Academic
arxiv.org·

AmrDeveloper/Turtle: A Heterogeneous Pythonic 🐍 language to practice targeting CPU & GPU in the same program on Mobile Devices Influenced by Python, Mojo and CUDA

 ⚙️Systems Programming  Content type: Code
github.com··Hacker News

Framework Desktop AMD 395+ (rdna 3.5) cannot run confyui err Fix 2026

 🤖AI  Content type: Blog
runaihome.com··DEV

GPUsnek is Python on nVidia’s CUDA

 🐧Kernel  Content type: Blog
blog.adafruit.com·

Exploiting GPU Tensor Cores from Java using Babylon

 LLM Inference
inside.java·

NVIDIA RTX Pro 6000 Blackwell: 96GB GDDR7 and the End of VRAM Anxiety

 LLM Inference  Content type: Blog
fitservers.com·
Less-relevant results

Polars GPU engine — cudf 26.06.01 documentation

 📈Performance Engineering  Content type: Reference
docs.rapids.ai··Hacker News

Vortex expands open RISC-V graphics

 🧠CXL
jonpeddie.com·

CommBench: Can LLMs Write Correct and Efficient GPU Communication Code?

 🤖LLMs

NVIDIA's RTX 5060 May Finally Get The VRAM Upgrade Gamers Wanted

 🧠CXL  Content type: News
hothardware.com·

NVIDIA at Computex 2026: RTX Spark Gaming Hands-On, DLSS 4.5, and More

 🧠CXL
techpowerup.com·

NVIDIA chip powers local AI workloads

 📈Performance Engineering
edn.com·

From GPU to Token: The 8-Layer Observability Stack for AI Infrastructure

 ✍️Prompt Engineering  Content type: Blog
jimmysong.io·

Nvidia's RTX Spark is a chip unlike any other, and it could change Windows laptops forever

 🧠CXL
xda-developers.com·

Nvidia GeForce RTX 50 Super GPUs may launch in early 2027 with 50% more VRAM

 🧠CXL
club386.com·

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

 📈Performance Engineering
smolhub.com··r/LocalLLaMA

Vortex 3.0 Released As Full-Stack, Open-Source RISC-V GPU Now With 3D Pipeline

 🧠CXL
phoronix.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help