GPU Computing

Feeds to Scour
SubscribedAll
Scoured 174 posts in 4.9 ms

Exploiting GPU Tensor Cores from Java using Babylon [Juan Fumero]

 💻Computer Architecture
openjdk.org··Lobsters, r/java

A BIOS update won't fix a board-specific ROCm bug on Strix Halo

 💾Cache Coherence

frankkk96/FlashQwen: From-scratch C++/CUDA inference engine for Qwen3-8B, with zero external libraries

 🐧Operating Systems  Content type: Code
github.com·
Less-relevant results

Training Cycle Halved: LoongForge End-to-End Optimization for GR00T N1.6 Delivers 2.3× Throughput

 🐧Operating Systems

Framework Desktop AMD 395+ (rdna 3.5) cannot run confyui err Fix 2026

 📐SIMD  Content type: Blog
runaihome.com··DEV

Making FlashAttention-4 faster for inference

 🐧Operating Systems  Content type: Blog

nomp: A Framework for Building Domain Specific Compilers

 🖥️HPC  Content type: Academic
arxiv.org·

Vortex 3.0 Released As Full-Stack, Open-Source RISC-V GPU Now With 3D Pipeline

 💻Computer Architecture

Nvidia GeForce RTX 2080 Ti Super prototype shows what could have been, with 4,608 CUDA cores

 🔲FPGA
club386.com·

Exploiting GPU Tensor Cores from Java using Babylon

 📐SIMD

GPUsnek is Python on nVidia’s CUDA

 🐧Operating Systems  Content type: Blog
blog.adafruit.com·

Nvidia’s RTX Spark to fuel Adobe creative apps

 📐SIMD
jonpeddie.com·

How to fit Qwen 3.6 35B A3B into 16GB of VRAM, & run it with Llama.cpp on an RTX 3080

 🐧Operating Systems
autodidacts.io·

Flatpak 1.18 adds AMD ROCm support, improved error output, and faster Fish shell start-up

 🐧Operating Systems
alternativeto.net·

Polars GPU engine — cudf 26.06.01 documentation

 📐SIMD  Content type: Reference

NVIDIA RTX Pro 6000 Blackwell: 96GB GDDR7 and the End of VRAM Anxiety

 📐SIMD  Content type: Blog
fitservers.com·

RTX 5080 + RTX 3090 Setup: 80+ Tok/s on Qwen 3.6 27B Q8

 🐧Operating Systems  Content type: Blog

Nvidia's RTX Spark is a chip unlike any other, and it could change Windows laptops forever

 Concurrency
xda-developers.com·

Introducing Piper: A Programmable Distributed Training System

 🐧Operating Systems  Content type: Academic  Content type: Blog

Redditor buys RTX 2080 Ti Super engineering sample on eBay, has the same number of cores as an RTX Titan but half the VRAM

 🔲FPGA  Content type: News
tweaktown.com·

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help