Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
⚡ Cuda
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
122390
posts in
1.48
s
AI in Multiple
GPUs
: Understanding the Host and Device
Paradigm
towardsdatascience.com
·
6h
⏱️
CUDA Events
datavorous/spheni
: An in-memory vector search library in C++ with Python bindings
github.com
·
1d
·
Discuss:
Hacker News
✂️
CUTLASS
GPU-Fuzz
: Finding Memory Errors in Deep Learning
Frameworks
arxiv.org
·
14h
🧮
cuDNN
Show HN: Solving
Sudoku
reasoning via Energy
Geometric
models
davisgeometric.com
·
10h
·
Discuss:
Hacker News
✂️
CUTLASS
Building a Zero-Dependency
secp256k1
CUDA
Engine from Scratch (2.5B ops/SEC)
github.com
·
1d
·
Discuss:
Hacker News
🔧
PTX
Implementing
3D Graphics
Basics
hackaday.com
·
16h
✂️
CUTLASS
BOute
: Cost-Efficient LLM Serving with Heterogeneous LLMs and GPUs via
Multi-Objective
Bayesian Optimization
arxiv.org
·
14h
🔗
NCCL
Microgpt.py
gist.github.com
·
21h
·
Discuss:
Hacker News
,
Hacker News
📉
Model Quantization
Two Ways to Move
Tensors
Without Stopping: Inside
vLLM
's Async GPU Transfer Patterns
dev.to
·
22h
·
Discuss:
DEV
🌊
CUDA Streams
NVIDIA
GeForce
NOW Turns
Screens
Into a Gaming Machine
elevenforum.com
·
4h
🎮
NVIDIA
A C implementation of the inference pipeline for the Mistral AI’s
Voxtral
Realtime
4B model
blog.adafruit.com
·
2h
🏎️
TensorRT
NVIDIA
DGX
Spark
Powers Big Projects in Higher Education
blogs.nvidia.com
·
4h
🔗
NCCL
Ming-flash-omni-2.0
: 100B MoE (6B active) omni-modal model - unified
speech/SFX/music
generation
huggingface.co
·
1h
·
Discuss:
r/LocalLLaMA
⚡
Flash Attention
The Efficiency Wall: Why the Next 1,000x
Leap
Isn’t More
GPUs
pub.towardsai.net
·
15h
🌊
CUDA Streams
EGPU
Enclosures
support
lemmy.ml
·
22h
📈
GPU Occupancy
Nvidia RTX
6000D
teardown
shows 84GB VRAM using 3GB memory chips
club386.com
·
2h
📈
GPU Occupancy
Porting an INT8 VHDL CNN from Intel
Agilex
3 to Lattice
Certus-NX
news.ycombinator.com
·
6h
·
Discuss:
Hacker News
🔧
PTX
What Agentic AI "Vibe Coding" In The Hands Of
Actual
Programmers
/ Engineers
stochasticlifestyle.com
·
7h
🔄
ONNX
CPU
cloth
simulation performance
comparable
to GPU SotA
sig25ddmpd.github.io
·
18h
·
Discuss:
Hacker News
✂️
CUTLASS
How
Programmers
Spend
Their Time
probablydance.com
·
1d
·
Discuss:
Hacker News
⚡
Flash Attention
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help