Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🎮 SIMT Execution
Specific
GPU Programming, Warp Divergence, Thread Blocks, CUDA Model
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
321
posts in
9.9
ms
CUDA
13.3: NVIDIA continues to move
GPU
programming
from the thread to the tile
🎨
WGPU
igorslab.de
·
5d
CUDA
Cores
vs.
Tensor
Cores
⚡
Hardware Acceleration
beam.cloud
·
1d
+12 years of
programming
, now what?
🎮
WebGPU
en.wikipedia.org
·
21h
·
r/programming
Nvidia Maxwell Architecture
🎮
WebGPU
developer.nvidia.com
·
14h
·
Hacker News
Caspar:
CUDA
Accelerator for Symbolic
Programming
with Adaptive Reordering
🌀
Naiad
arxiv.org
·
1d
llama.cpp B9387 Significant AMD/ROCm PP Update
🧮
MKL
github.com
·
4d
·
r/LocalLLaMA
When does fragmentation occur in the
CUDA
caching allocator?
🧱
Slab Allocation
docs.pytorch.org
·
11h
·
Hacker News
Nvidia ARM Laptop Chip N1X Confirmed for Computex:
CUDA
and RTX 5070
GPU
Onboard
⚡
Hardware Acceleration
techtimes.com
·
2d
NVIDIA
CUDA
13.3 Rolls Out
CUDA
Python 1.0,
CUDA
Tile For C++
⚡
Hardware Acceleration
lxer.com
·
5d
Nvidia's long-awaited N1/N1X SoC specs leak ahead of Computex launch — N1 to feature up to 20 Arm-based
cores
, standard N1 equipped with 12- and 10-core configs
⚡
Hardware Acceleration
tomshardware.com
·
1d
ROS2 vs Isaac ROS: 8x Perception Speedup with NITROS
⚡
LMAX Disruptor
tildalice.io
·
3d
openclaw/clawpatch v0.5.0
⚡
Ruff
github.com
·
1d
RAFI -- A Ray/Work Forwarding Infrastructure for Data Parallel
Multi-Node/Multi-GPU
Computing
🚀
Milvus
arxiv.org
·
4d
avencera/speakrs: Speaker diarization in Rust. 312–912x realtime on Apple Silicon, 50–121x on
CUDA
. Matches pyannote accuracy.
🍱
Nom
github.com
·
6d
·
Hacker News
,
r/rust
Towards Feedback-to-Plan Decisions for Self-Evolving LLM Agents in
CUDA
Kernel
Generation
💬
Prompt Engineering
arxiv.org
·
6d
jmaczan/tiny-vllm: Build your own high performance LLM inference engine in C++ and
CUDA
- a smaller version of vLLM
🦙
Ollama
github.com
·
3d
·
Hacker News
TC-MIS: Maximal Independent Set on
Tensor-cores
🕸️
GraphBLAS
arxiv.org
·
4d
jndean/gpusnek:
GPU-Parallelizing
Arbitrary Python Code By Running 1 Million Python Interpreters on a
GPU
🐍
🎮
WebGPU
github.com
·
5d
·
Hacker News
zayokami/Talos-XII: A deep learning framework based on the gacha mechanics of Arknights: Endfield. 以《明日方舟:终末地》的抽卡学习为基准的深度学习框架。
⚙️
XLA
github.com
·
3d
·
r/rust
NVIDIA
CUDA
13.3 Rolls Out
CUDA
Python 1.0,
CUDA
Tile For C++
⚡
Hardware Acceleration
phoronix.com
·
5d
·
Hacker News
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help