Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
CUDA
⚡ CUDA
Specific
GPU Programming, Kernel Optimization, Parallel Computing, NVIDIA
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
279
posts in
8.4
ms
Exploiting
GPU
Tensor
Cores
from Java using Babylon [Juan Fumero]
🔺
Triton
openjdk.org
·
2d
2 days ago
·
Lobsters
,
r/java
Actions for Exploiting GPU Tensor Cores from Java using Babylon [Juan Fumero]
CUDA-Oxide
0.2 Brings Early Improvements To Pure Rust
CUDA
Kernels
🔺
Triton
phoronix.com
·
5d
5 days ago
Actions for CUDA-Oxide 0.2 Brings Early Improvements To Pure Rust CUDA Kernels
GPUsnek is Python on
nVidia
’s
CUDA
🔺
Triton
Content type:
Blog
blog.adafruit.com
·
16h
16 hours ago
Actions for GPUsnek is Python on nVidia’s CUDA
WarpGuard
: Protected-Site Control-Flow Integrity for
CUDA
SASS Binaries
🔺
Triton
Content type:
Academic
arxiv.org
·
8h
8 hours ago
Actions for WarpGuard: Protected-Site Control-Flow Integrity for CUDA SASS Binaries
RightNow-AI/AutoMegaKernel: An agent harness that compiles a model into one provably-correct, self-retargeting
CUDA
megakernel and self-tunes it past
cuBLAS
at batch-1 LLM decode.
💾
KV Cache
Content type:
Code
github.com
·
2d
2 days ago
·
Hacker News
Actions for RightNow-AI/AutoMegaKernel: An agent harness that compiles a model into one provably-correct, self-retargeting CUDA megakernel and self-tunes it past cuBLAS at batch-1 LLM decode.
Framework Desktop AMD 395+ (rdna 3.5) cannot run confyui err Fix 2026
🔺
Triton
Content type:
Blog
runaihome.com
·
3d
3 days ago
·
DEV
Actions for Framework Desktop AMD 395+ (rdna 3.5) cannot run confyui err Fix 2026
Exploiting
GPU
Tensor
Cores
from Java using Babylon
🔺
Triton
inside.java
·
1d
1 day ago
Actions for Exploiting GPU Tensor Cores from Java using Babylon
Less-relevant results
NVIDIA
Accelerates Google DeepMind’s DiffusionGemma for Local AI
🔄
Transformers
Content type:
Blog
blogs.nvidia.com
·
19h
19 hours ago
Actions for NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
Proton Experimental gets fixes for Path of Exile 1 & 2, Guild
Wars
2, Call of Duty (2003), Exanima and more
🔺
Triton
Content type:
News
gamingonlinux.com
·
3h
3 hours ago
·
r/SteamDeck
,
r/linux_gaming
Actions for Proton Experimental gets fixes for Path of Exile 1 & 2, Guild Wars 2, Call of Duty (2003), Exanima and more
Core
Automation co-founder Jerry Tworek jokes that
Nvidia
's
CUDA
translates to miracles in Polish
🔺
Triton
digg.com
·
6d
6 days ago
Actions for Core Automation co-founder Jerry Tworek jokes that Nvidia's CUDA translates to miracles in Polish
Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent
⚡
Inference Optimization
Content type:
Blog
dnhkng.github.io
·
2d
2 days ago
Actions for Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent
Apple expands Private Cloud
Compute
to Google Cloud and
NVIDIA
hardware
🔲
TPU Architecture
4sysops.com
·
22h
22 hours ago
Actions for Apple expands Private Cloud Compute to Google Cloud and NVIDIA hardware
From
GPU
to Token: The 8-Layer Observability Stack for AI Infrastructure
⚡
Inference Optimization
Content type:
Blog
jimmysong.io
·
2d
2 days ago
Actions for From GPU to Token: The 8-Layer Observability Stack for AI Infrastructure
NVIDIA
chip powers local AI workloads
🤖
agentic system
edn.com
·
17h
17 hours ago
Actions for NVIDIA chip powers local AI workloads
Nvidia
RTX Spark: The $2,900 Floor Tells You Everything
🤖
agentic system
Content type:
Blog
Content type:
Discussion
tildalice.io
·
6d
6 days ago
Actions for Nvidia RTX Spark: The $2,900 Floor Tells You Everything
Microsoft might be trimming AI excesses, but make no mistake — it's bringing AI features to more Windows 11 PCs, as a new initiative clearly shows
🤖
agentic system
Content type:
News
techradar.com
·
31m
31 minutes ago
Actions for Microsoft might be trimming AI excesses, but make no mistake — it's bringing AI features to more Windows 11 PCs, as a new initiative clearly shows
AMD's Lemonade SDK For Local AI Adds
NVIDIA
CUDA
Support
💾
KV Cache
phoronix.com
·
19h
19 hours ago
·
r/artificial
Actions for AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support
Train Models Faster with JAX and MaxText Using NVFP4 on
NVIDIA
Blackwell
⚡
Inference Optimization
Content type:
News
Content type:
Blog
developer.nvidia.com
·
2d
2 days ago
Actions for Train Models Faster with JAX and MaxText Using NVFP4 on NVIDIA Blackwell
NVIDIA
's RTX 5060 May Finally Get The VRAM Upgrade Gamers Wanted
🔺
Triton
Content type:
News
hothardware.com
·
5d
5 days ago
Actions for NVIDIA's RTX 5060 May Finally Get The VRAM Upgrade Gamers Wanted
Ollama 0.30
GPU
Boost: Faster local Qwen inference on
NVIDIA
🔺
Triton
everylocalai.com
·
15h
15 hours ago
·
DEV
Actions for Ollama 0.30 GPU Boost: Faster local Qwen inference on NVIDIA
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help