Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
CUDA
🎮 CUDA
Specific
CUDA programming, GPU kernel, NVIDIA, parallel GPU
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
169
posts in
9.7
ms
RightNow-AI/AutoMegaKernel: An agent harness that compiles a model into one
provably-correct
, self-retargeting
CUDA
megakernel and self-tunes it past
cuBLAS
at batch-1 LLM decode.
⚡
Parallel Computing
Content type:
Code
github.com
·
2d
2 days ago
·
Hacker News
Actions for RightNow-AI/AutoMegaKernel: An agent harness that compiles a model into one provably-correct, self-retargeting CUDA megakernel and self-tunes it past cuBLAS at batch-1 LLM decode.
CUDA-Oxide
0.2 Brings Early Improvements To Pure Rust
CUDA
Kernels
🖼️
GPU Computing
phoronix.com
·
5d
5 days ago
Actions for CUDA-Oxide 0.2 Brings Early Improvements To Pure Rust CUDA Kernels
Framework Desktop AMD 395+ (rdna 3.5) cannot run confyui err Fix 2026
🖼️
GPU Computing
Content type:
Blog
runaihome.com
·
2d
2 days ago
·
DEV
Actions for Framework Desktop AMD 395+ (rdna 3.5) cannot run confyui err Fix 2026
Less-relevant results
Density Field State Space Models: 1-Bit Distillation, Efficient Inference, and Knowledge Organization in Mamba-2
⚡
Parallel Computing
Content type:
Academic
arxiv.org
·
14h
14 hours ago
Actions for Density Field State Space Models: 1-Bit Distillation, Efficient Inference, and Knowledge Organization in Mamba-2
NVIDIA
Nsight
Compute
🖼️
GPU Computing
developer.nvidia.com
·
6d
6 days ago
Actions for NVIDIA Nsight Compute
Exploiting
GPU
Tensor
Cores
from Java using Babylon [Juan Fumero]
⚡
Parallel Computing
openjdk.org
·
1d
1 day ago
·
r/java
Actions for Exploiting GPU Tensor Cores from Java using Babylon [Juan Fumero]
Microsoft's Surface Laptop Ultra Announced! #shorts
🖼️
GPU Computing
Content type:
Video
youtube.com
·
6d
6 days ago
Actions for Microsoft's Surface Laptop Ultra Announced! #shorts
Nvidia
GeForce RTX 50 Super GPUs may launch in early 2027 with 50% more VRAM
🖼️
GPU Computing
club386.com
·
1d
1 day ago
Actions for Nvidia GeForce RTX 50 Super GPUs may launch in early 2027 with 50% more VRAM
NVIDIA
's RTX 5060 May Finally Get The VRAM Upgrade Gamers Wanted
🖼️
GPU Computing
Content type:
News
hothardware.com
·
5d
5 days ago
Actions for NVIDIA's RTX 5060 May Finally Get The VRAM Upgrade Gamers Wanted
1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM
⚡
Parallel Computing
smolhub.com
·
2d
2 days ago
·
r/LocalLLaMA
Actions for 1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM
New comment by bhvk08 in "Ask HN: Who wants to be hired? (June 2026)"
⚡
Parallel Computing
Content type:
Discussion
news.ycombinator.com
·
6d
6 days ago
·
Hacker News
Actions for New comment by bhvk08 in "Ask HN: Who wants to be hired? (June 2026)"
NVIDIA
’s New RTX Spark Superchip Changes Everything for On-the-Go 12K Video Editing and 3D Rendering
🖼️
GPU Computing
canonrumors.com
·
2d
2 days ago
Actions for NVIDIA’s New RTX Spark Superchip Changes Everything for On-the-Go 12K Video Editing and 3D Rendering
AMD's Lemonade SDK For Local AI Adds
NVIDIA
CUDA
Support
🖼️
GPU Computing
phoronix.com
·
1h
1 hour ago
Actions for AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support
Core
Automation co-founder Jerry Tworek jokes that
Nvidia
's
CUDA
translates to miracles in Polish
🖼️
GPU Computing
digg.com
·
6d
6 days ago
Actions for Core Automation co-founder Jerry Tworek jokes that Nvidia's CUDA translates to miracles in Polish
Release TorchCodec 0.14: HDR Video Decoding for CPU &
CUDA
, and Fast Wav Decoder · meta-pytorch/torchcodec
🖼️
GPU Computing
Content type:
Code
github.com
·
4h
4 hours ago
·
Hacker News
Actions for Release TorchCodec 0.14: HDR Video Decoding for CPU & CUDA, and Fast Wav Decoder · meta-pytorch/torchcodec
Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent
⚡
Parallel Computing
Content type:
Blog
dnhkng.github.io
·
1d
1 day ago
Actions for Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent
Nvidia
RTX Spark: The $2,900 Floor Tells You Everything
🖼️
GPU Computing
Content type:
Blog
Content type:
Discussion
tildalice.io
·
6d
6 days ago
Actions for Nvidia RTX Spark: The $2,900 Floor Tells You Everything
APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM
Compute
Rebalancing
⚡
Parallel Computing
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for APEX4: Efficient Pure W4A4 LLM Inference via Intra-SM Compute Rebalancing
Unreleased RTX 3050 Ti graphics card spotted in the wild, GA106
GPU
with 6GB VRAM
🖼️
GPU Computing
Content type:
News
tweaktown.com
·
3d
3 days ago
Actions for Unreleased RTX 3050 Ti graphics card spotted in the wild, GA106 GPU with 6GB VRAM
DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
⚡
Parallel Computing
Content type:
News
newsletter.semianalysis.com
·
1d
1 day ago
·
Hacker News
Actions for DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help