Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
GPGPU
🎮 GPGPU
GPU computing, CUDA, OpenCL, parallel processing
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
215
posts in
7.0
ms
Exploiting
GPU
Tensor
Cores
from Java using Babylon [Juan Fumero]
Â
âš¡
CUDA
openjdk.org
·
1d
1 day ago
·
r/java
Actions for Exploiting GPU Tensor Cores from Java using Babylon [Juan Fumero]
CUDA-Oxide
0.2 Brings Early Improvements To Pure Rust
CUDA
Kernels
Â
🔀
Parallel Computing
phoronix.com
·
5d
5 days ago
Actions for CUDA-Oxide 0.2 Brings Early Improvements To Pure Rust CUDA Kernels
Exploiting
GPU
Tensor
Cores
from Java using Babylon
Â
🔀
Parallel Computing
inside.java
·
23h
23 hours ago
Actions for Exploiting GPU Tensor Cores from Java using Babylon
Framework Desktop AMD 395+ (rdna 3.5) cannot run confyui err Fix 2026
Â
âš¡
CUDA
Â
Content type:
Blog
runaihome.com
·
2d
2 days ago
·
DEV
Actions for Framework Desktop AMD 395+ (rdna 3.5) cannot run confyui err Fix 2026
Release TorchCodec 0.14: HDR Video Decoding for CPU &
CUDA
, and Fast Wav Decoder · meta-pytorch/torchcodec
Â
🔀
Parallel Computing
Â
Content type:
Code
github.com
·
9h
9 hours ago
·
Hacker News
Actions for Release TorchCodec 0.14: HDR Video Decoding for CPU & CUDA, and Fast Wav Decoder · meta-pytorch/torchcodec
NVIDIA Nsight
Compute
Â
🔀
Parallel Computing
developer.nvidia.com
·
6d
6 days ago
Actions for NVIDIA Nsight Compute
APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM
Compute
Rebalancing
Â
âš¡
CUDA
Â
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing
NVIDIA chip powers local AI workloads
Â
âš¡
CUDA
edn.com
·
5h
5 hours ago
Actions for NVIDIA chip powers local AI workloads
Flatpak 1.18 adds AMD
ROCm
support, improved error output, and faster Fish shell start-up
Â
âš¡
CUDA
alternativeto.net
·
1d
1 day ago
Actions for Flatpak 1.18 adds AMD ROCm support, improved error output, and faster Fish shell start-up
Full Context on a Vulkan-Only Strix Halo: The Decode-Drop Reproduces, but the Sweet Spot Moves
Â
🎯
Low Latency
thefrontierlab.ai
·
6d
6 days ago
·
Hacker News
Actions for Full Context on a Vulkan-Only Strix Halo: The Decode-Drop Reproduces, but the Sweet Spot Moves
HydraMPP: A lightweight library for distributed
massive
parallel
processing
in Python - threading at scale.
Â
🔀
Parallel Computing
Â
Content type:
Academic
biorxiv.org
·
2d
2 days ago
Actions for HydraMPP: A lightweight library for distributed massive parallel processing in Python - threading at scale.
Core
Automation co-founder Jerry Tworek jokes that Nvidia's
CUDA
translates to miracles in Polish
Â
🔀
Parallel Computing
digg.com
·
6d
6 days ago
Actions for Core Automation co-founder Jerry Tworek jokes that Nvidia's CUDA translates to miracles in Polish
NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
Â
🤖
ML Systems
Â
Content type:
Blog
blogs.nvidia.com
·
7h
7 hours ago
Actions for NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
NVIDIA’s New RTX Spark Superchip Changes Everything for On-the-Go 12K Video Editing and 3D Rendering
Â
🔀
Parallel Computing
canonrumors.com
·
2d
2 days ago
Actions for NVIDIA’s New RTX Spark Superchip Changes Everything for On-the-Go 12K Video Editing and 3D Rendering
New comment by bhvk08 in "Ask HN: Who wants to be hired? (June 2026)"
Â
🔀
Parallel Computing
Â
Content type:
Discussion
news.ycombinator.com
·
1w
1 week ago
·
Hacker News
Actions for New comment by bhvk08 in "Ask HN: Who wants to be hired? (June 2026)"
AMD's Lemonade SDK For Local AI Adds NVIDIA
CUDA
Support
Â
🔀
Parallel Computing
phoronix.com
·
7h
7 hours ago
Actions for AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support
1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM
Â
🔀
Parallel Computing
smolhub.com
·
2d
2 days ago
·
r/LocalLLaMA
Actions for 1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM
DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
Â
🤖
ML Systems
Â
Content type:
News
newsletter.semianalysis.com
·
1d
1 day ago
·
Hacker News
Actions for DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
Nvidia RTX Spark: The $2,900 Floor Tells You Everything
Â
🔀
Parallel Computing
Â
Content type:
Blog
Â
Content type:
Discussion
tildalice.io
·
6d
6 days ago
Actions for Nvidia RTX Spark: The $2,900 Floor Tells You Everything
From
GPU
to Token: The 8-Layer Observability Stack for AI Infrastructure
Â
🔀
Parallel Computing
Â
Content type:
Blog
jimmysong.io
·
1d
1 day ago
Actions for From GPU to Token: The 8-Layer Observability Stack for AI Infrastructure
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help