Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
CUDA
🟢 CUDA
Specific
CUDA kernels, NVIDIA GPU programming, PTX, cuBLAS
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
163
posts in
9.4
ms
Exploiting
GPU
Tensor
Cores
from Java using Babylon [Juan Fumero]
🎮
GPU Computing
openjdk.org
·
1d
1 day ago
·
r/java
Actions for Exploiting GPU Tensor Cores from Java using Babylon [Juan Fumero]
CUDA-Oxide
0.2 Brings Early Improvements To Pure Rust
CUDA
Kernels
🎮
GPU Computing
phoronix.com
·
5d
5 days ago
Actions for CUDA-Oxide 0.2 Brings Early Improvements To Pure Rust CUDA Kernels
Exploiting
GPU
Tensor
Cores
from Java using Babylon
⚡
Triton
inside.java
·
19h
19 hours ago
Actions for Exploiting GPU Tensor Cores from Java using Babylon
RightNow-AI/AutoMegaKernel: An agent harness that compiles a model into one
provably-correct
, self-retargeting
CUDA
megakernel and self-tunes it past
cuBLAS
at batch-1 LLM decode.
🔢
GEMM Optimization
Content type:
Code
github.com
·
2d
2 days ago
·
Hacker News
Actions for RightNow-AI/AutoMegaKernel: An agent harness that compiles a model into one provably-correct, self-retargeting CUDA megakernel and self-tunes it past cuBLAS at batch-1 LLM decode.
Density Field State Space Models: 1-Bit Distillation, Efficient Inference, and Knowledge Organization in Mamba-2
🎮
GPU Computing
Content type:
Academic
arxiv.org
·
15h
15 hours ago
Actions for Density Field State Space Models: 1-Bit Distillation, Efficient Inference, and Knowledge Organization in Mamba-2
NVIDIA
Nsight Compute
🎮
GPU Computing
developer.nvidia.com
·
6d
6 days ago
Actions for NVIDIA Nsight Compute
NVIDIA
Confidential Computing to Help Expand Apple’s Private Cloud Compute
🎮
GPU Computing
Content type:
Blog
blogs.nvidia.com
·
20h
20 hours ago
Actions for NVIDIA Confidential Computing to Help Expand Apple’s Private Cloud Compute
Ollama 0.30 delivers faster
NVIDIA
GPU
performance and wider hardware support
🧠
Inference Engineering
alternativeto.net
·
2d
2 days ago
Actions for Ollama 0.30 delivers faster NVIDIA GPU performance and wider hardware support
Less-relevant results
Apple expands Private Cloud Compute to Google Cloud and
NVIDIA
hardware
⚗️
Kernel Fusion
4sysops.com
·
5h
5 hours ago
Actions for Apple expands Private Cloud Compute to Google Cloud and NVIDIA hardware
Nvidia
RTX Spark: The $2,900 Floor Tells You Everything
🎮
GPU Computing
Content type:
Blog
Content type:
Discussion
tildalice.io
·
6d
6 days ago
Actions for Nvidia RTX Spark: The $2,900 Floor Tells You Everything
Google Pays SpaceX $920M/Month for AI Compute (4 minute read)
💰
Inference Cost
winbuzzer.com
·
2d
2 days ago
Actions for Google Pays SpaceX $920M/Month for AI Compute (4 minute read)
NVIDIA
chip powers local AI workloads
🎮
GPU Computing
edn.com
·
50m
50 minutes ago
Actions for NVIDIA chip powers local AI workloads
Google-SpaceX $30B Compute Deal Raises Cloud Buyer Questions
☁️
Cloud Infrastructure
techrepublic.com
·
2d
2 days ago
Actions for Google-SpaceX $30B Compute Deal Raises Cloud Buyer Questions
how to make brave use
nvidia
gpu
on ubuntu?
⚗️
Kernel Fusion
lemmy.ml
·
13h
13 hours ago
Actions for how to make brave use nvidia gpu on ubuntu?
From
GPU
to Token: The 8-Layer Observability Stack for AI Infrastructure
💰
Inference Cost
Content type:
Blog
jimmysong.io
·
1d
1 day ago
Actions for From GPU to Token: The 8-Layer Observability Stack for AI Infrastructure
KJLdefeated/RL.cu
: RLVR training for LLM in CUDA/C++
💾
KV Cache
Content type:
Code
github.com
·
3d
3 days ago
·
Hacker News
Actions for KJLdefeated/RL.cu: RLVR training for LLM in CUDA/C++
AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis
🎮
GPU Computing
Content type:
Academic
arxiv.org
·
1d
1 day ago
·
Hacker News
Actions for AutoMegaKernel: A Statically-Checked Agent Harness for Self-Retargeting Megakernel Synthesis
Particle: SpaceX Discloses $920 Million‑a‑Month Google Compute Deal Ahead of IPO
⚗️
Kernel Fusion
Content type:
News
particle.news
·
5d
5 days ago
Actions for Particle: SpaceX Discloses $920 Million‑a‑Month Google Compute Deal Ahead of IPO
Apple extends Private Cloud Compute to third-party data centers
☁️
Cloud Infrastructure
helpnetsecurity.com
·
9h
9 hours ago
Actions for Apple extends Private Cloud Compute to third-party data centers
1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM
⏱️
Prefill Decoding
smolhub.com
·
2d
2 days ago
·
r/LocalLLaMA
Actions for 1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help