GPU Computing

Feeds to Scour
SubscribedAll
Scoured 176 posts in 7.9 ms

Exploiting GPU Tensor Cores from Java using Babylon [Juan Fumero]

 💻Computer Architecture
openjdk.org··Lobsters, r/java

A BIOS update won't fix a board-specific ROCm bug on Strix Halo

 💾Cache Coherence
Less-relevant results

Training Cycle Halved: LoongForge End-to-End Optimization for GR00T N1.6 Delivers 2.3× Throughput

 🐧Operating Systems

WarpGuard: Protected-Site Control-Flow Integrity for CUDA SASS Binaries

 🐧Operating Systems  Content type: Academic
arxiv.org·

Framework Desktop AMD 395+ (rdna 3.5) cannot run confyui err Fix 2026

 📐SIMD  Content type: Blog
runaihome.com··DEV

AmrDeveloper/Turtle: A Heterogeneous Pythonic 🐍 language to practice targeting CPU & GPU in the same program on Mobile Devices Influenced by Python, Mojo and CUDA

 📐SIMD  Content type: Code
github.com··Hacker News

Vortex 3.0 Released As Full-Stack, Open-Source RISC-V GPU Now With 3D Pipeline

 💻Computer Architecture

Making FlashAttention-4 faster for inference

 🐧Operating Systems  Content type: Blog
modal.com··Hacker News

Exploiting GPU Tensor Cores from Java using Babylon

 📐SIMD

Nvidia GeForce RTX 2080 Ti Super prototype shows what could have been, with 4,608 CUDA cores

 🔲FPGA
club386.com·

GPUsnek is Python on nVidia’s CUDA

 🐧Operating Systems  Content type: Blog
blog.adafruit.com·

How to fit Qwen 3.6 35B A3B into 16GB of VRAM, & run it with Llama.cpp on an RTX 3080

 🐧Operating Systems
autodidacts.io·

Vortex expands open RISC-V graphics

 💻Computer Architecture
jonpeddie.com·

Polars GPU engine — cudf 26.06.01 documentation

 📐SIMD  Content type: Reference

Flatpak 1.18 adds AMD ROCm support, improved error output, and faster Fish shell start-up

 🐧Operating Systems
alternativeto.net·

CommBench: Can LLMs Write Correct and Efficient GPU Communication Code?

 🐧Operating Systems

NVIDIA RTX Pro 6000 Blackwell: 96GB GDDR7 and the End of VRAM Anxiety

 📐SIMD  Content type: Blog
fitservers.com·

RTX 5080 + RTX 3090 Setup: 80+ Tok/s on Qwen 3.6 27B Q8

 🐧Operating Systems  Content type: Blog

Nvidia's RTX Spark is a chip unlike any other, and it could change Windows laptops forever

 Concurrency
xda-developers.com·

Introducing Piper: A Programmable Distributed Training System

 🐧Operating Systems  Content type: Academic  Content type: Blog

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help