Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
GPU Computing
🖥️ GPU Computing
CUDA, GPU programming, parallel computing, GPGPU
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
184
posts in
7.1
ms
RightNow-AI/AutoMegaKernel: An agent harness that compiles a model into one provably-correct, self-retargeting
CUDA
megakernel and self-tunes it past
cuBLAS
at batch-1 LLM decode.
🤖
Machine Learning
Content type:
Code
github.com
·
2d
2 days ago
·
Hacker News
Actions for RightNow-AI/AutoMegaKernel: An agent harness that compiles a model into one provably-correct, self-retargeting CUDA megakernel and self-tunes it past cuBLAS at batch-1 LLM decode.
Framework Desktop AMD 395+ (rdna 3.5) cannot run confyui err Fix 2026
🤖
Machine Learning
Content type:
Blog
runaihome.com
·
3d
3 days ago
·
DEV
Actions for Framework Desktop AMD 395+ (rdna 3.5) cannot run confyui err Fix 2026
AMD's Lemonade SDK For Local AI Adds NVIDIA
CUDA
Support
💾
Flash Storage
phoronix.com
·
7h
7 hours ago
Actions for AMD's Lemonade SDK For Local AI Adds NVIDIA CUDA Support
AgentCompile: An LLM-Guided Compiler for Direct
CUDA
Inference
🤖
Machine Learning
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for AgentCompile: An LLM-Guided Compiler for Direct CUDA Inference
NVIDIA Nsight
Compute
💾
Flash Storage
developer.nvidia.com
·
6d
6 days ago
Actions for NVIDIA Nsight Compute
Exploiting
GPU
Tensor Cores from Java using Babylon [Juan Fumero]
📅
Scheduling Algorithms
openjdk.org
·
1d
1 day ago
·
r/java
Actions for Exploiting GPU Tensor Cores from Java using Babylon [Juan Fumero]
NVIDIA chip powers local AI workloads
🧠
Computational Neuroscience
edn.com
·
5h
5 hours ago
Actions for NVIDIA chip powers local AI workloads
Flatpak 1.18 adds AMD
ROCm
support, improved error output, and faster Fish shell start-up
💾
Flash Storage
alternativeto.net
·
1d
1 day ago
Actions for Flatpak 1.18 adds AMD ROCm support, improved error output, and faster Fish shell start-up
Full Context on a Vulkan-Only Strix Halo: The Decode-Drop Reproduces, but the Sweet Spot Moves
💾
Flash Storage
thefrontierlab.ai
·
6d
6 days ago
·
Hacker News
Actions for Full Context on a Vulkan-Only Strix Halo: The Decode-Drop Reproduces, but the Sweet Spot Moves
Core Automation
co-founder
Jerry Tworek jokes that Nvidia's
CUDA
translates to miracles in Polish
💾
Flash Storage
digg.com
·
6d
6 days ago
Actions for Core Automation co-founder Jerry Tworek jokes that Nvidia's CUDA translates to miracles in Polish
DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
🤖
Machine Learning
Content type:
News
newsletter.semianalysis.com
·
1d
1 day ago
·
Hacker News
Actions for DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time - Huawei, GB300 NVL72, MI355X, B200
Location: Edmonton, Canada Remote: Yes Willing to relocate: Yes, within Canada T...
🗄️
Databases
Content type:
Discussion
news.ycombinator.com
·
3h
3 hours ago
·
Hacker News
Actions for Location: Edmonton, Canada Remote: Yes Willing to relocate: Yes, within Canada T...
Nvidia RTX Spark: The $2,900 Floor Tells You Everything
🤖
Machine Learning
Content type:
Blog
Content type:
Discussion
tildalice.io
·
6d
6 days ago
Actions for Nvidia RTX Spark: The $2,900 Floor Tells You Everything
NVIDIA’s New RTX Spark Superchip Changes Everything for On-the-Go 12K Video Editing and 3D Rendering
💾
Flash Storage
canonrumors.com
·
2d
2 days ago
Actions for NVIDIA’s New RTX Spark Superchip Changes Everything for On-the-Go 12K Video Editing and 3D Rendering
Nvidia GeForce RTX 50 Super GPUs may launch in early 2027 with 50% more VRAM
💾
Flash Storage
club386.com
·
1d
1 day ago
Actions for Nvidia GeForce RTX 50 Super GPUs may launch in early 2027 with 50% more VRAM
WSL 3 will finally let Linux apps use your
GPU
and NPU without the performance tax
🤖
Machine Learning
xda-developers.com
·
7h
7 hours ago
Actions for WSL 3 will finally let Linux apps use your GPU and NPU without the performance tax
Microsoft's Surface Laptop Ultra Announced! #shorts
💾
Flash Storage
Content type:
Video
youtube.com
·
6d
6 days ago
Actions for Microsoft's Surface Laptop Ultra Announced! #shorts
Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent
💾
Flash Storage
Content type:
Blog
dnhkng.github.io
·
2d
2 days ago
Actions for Building & Benchmarking: LLMs on a 16GB Jetson Orin NX for Hermes Agent
Release TorchCodec 0.14: HDR Video Decoding for CPU &
CUDA
, and Fast Wav Decoder · meta-pytorch/torchcodec
🤖
Machine Learning
Content type:
Code
github.com
·
10h
10 hours ago
·
Hacker News
Actions for Release TorchCodec 0.14: HDR Video Decoding for CPU & CUDA, and Fast Wav Decoder · meta-pytorch/torchcodec
1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM
🤖
Machine Learning
smolhub.com
·
2d
2 days ago
·
r/LocalLLaMA
Actions for 1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help