Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
CUDA
🎮 CUDA
Specific
GPU programming, NVIDIA, CUDA kernels, GPU optimization
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
43
posts in
6.5
ms
Microsoft Weekly: Surface Laptop Ultra, Windows 11 context menus, Build 2026 recap, and more
🏗️
AI Infra
neowin.net
·
5d
5 days ago
Actions for Microsoft Weekly: Surface Laptop Ultra, Windows 11 context menus, Build 2026 recap, and more
Vortex 3.0 Released As Full-Stack, Open-Source RISC-V
GPU
Now With 3D Pipeline
💻
GPU Computing
phoronix.com
·
2d
2 days ago
Actions for Vortex 3.0 Released As Full-Stack, Open-Source RISC-V GPU Now With 3D Pipeline
Less-relevant results
Vortex expands open RISC-V
graphics
💻
GPU Computing
jonpeddie.com
·
16h
16 hours ago
Actions for Vortex expands open RISC-V graphics
1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM
💻
GPU Computing
smolhub.com
·
3d
3 days ago
·
r/LocalLLaMA
Actions for 1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM
Edge AI deployment made easy for system integrators
🏗️
AI Infra
edn.com
·
1d
1 day ago
Actions for Edge AI deployment made easy for system integrators
Nvidia
's best
GPU
feature is hiding in VLC's settings, and you're probably missing it
💻
GPU Computing
xda-developers.com
·
4d
4 days ago
Actions for Nvidia's best GPU feature is hiding in VLC's settings, and you're probably missing it
Build a local voice agent with Red Hat OpenShift AI
🧠
LLMs
developers.redhat.com
·
3d
3 days ago
Actions for Build a local voice agent with Red Hat OpenShift AI
Density Field State Space Models: 1-Bit Distillation, Efficient Inference, and Knowledge Organization in Mamba-2
🏗️
AI Infra
Content type:
Academic
arxiv.org
·
1d
1 day ago
Actions for Density Field State Space Models: 1-Bit Distillation, Efficient Inference, and Knowledge Organization in Mamba-2
Five labs, five minds: building a multi-model finance drama on small models
🏗️
AI Infra
Content type:
Blog
huggingface.co
·
4d
4 days ago
Actions for Five labs, five minds: building a multi-model finance drama on small models
Nvidia
DGX Spark GB10 – AI Models and Guide with vLLM and Autonomous Script
🧠
LLMs
Content type:
Code
github.com
·
5d
5 days ago
·
Hacker News
Actions for Nvidia DGX Spark GB10 – AI Models and Guide with vLLM and Autonomous Script
This Is the Hidden ‘AI Tax’ That Founders Need to Budget For
💻
GPU Computing
entrepreneur.com
·
1d
1 day ago
Actions for This Is the Hidden ‘AI Tax’ That Founders Need to Budget For
Holding the FP8 Quality Ceiling at 8-Bit Weights and Activations: INT8 and GGUF Post-Training Quantization of Ideogram 4.0 for Consumer GPUs
💻
GPU Computing
Content type:
Academic
arxiv.org
·
9h
9 hours ago
Actions for Holding the FP8 Quality Ceiling at 8-Bit Weights and Activations: INT8 and GGUF Post-Training Quantization of Ideogram 4.0 for Consumer GPUs
NetX-lab/Frontier: Frontier: A Discrete-Event Simulator for Modern LLM Serving
🧠
LLMs
Content type:
Code
github.com
·
6h
6 hours ago
·
Hacker News
Actions for NetX-lab/Frontier: Frontier: A Discrete-Event Simulator for Modern LLM Serving
Gram
Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon
🐧
Operating Systems
Content type:
Blog
tridao.me
·
2d
2 days ago
·
Hacker News
Actions for Gram Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon
SET: Stream-Event-Triggered Scheduling for Efficient
CUDA
Graph
Pipelines
🏗️
AI Infra
Content type:
Academic
arxiv.org
·
6d
6 days ago
Actions for SET: Stream-Event-Triggered Scheduling for Efficient CUDA Graph Pipelines
Gated DeltaNet, From First Principles
💻
GPU Computing
Content type:
Blog
sankalp.bearblog.dev
·
1d
1 day ago
Actions for Gated DeltaNet, From First Principles
🥇Top AI Papers of the Week
🏗️
AI Infra
Content type:
News
nlp.elvissaravia.com
·
3d
3 days ago
Actions for 🥇Top AI Papers of the Week
sgl-project/sglang-omni: SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models
💻
GPU Computing
Content type:
Code
github.com
·
1d
1 day ago
Actions for sgl-project/sglang-omni: SGLang Omni: High-Performance Multi-Stage Pipeline Framework for Omni Models
Beyond Per-Token Pricing: A Concurrency-Aware Methodology for LLM Infrastructure Cost Estimation
🧠
LLMs
Content type:
Academic
arxiv.org
·
9h
9 hours ago
Actions for Beyond Per-Token Pricing: A Concurrency-Aware Methodology for LLM Infrastructure Cost Estimation
Open source building blocks for
computational
design. Est. 2006
💻
GPU Computing
thi.ng
·
3d
3 days ago
·
Hacker News
Actions for Open source building blocks for computational design. Est. 2006
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help