Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
๐ข cuBLAS
CUDA Linear Algebra, Matrix Operations, GPU BLAS, cuBLASLt
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
80275
posts in
621.9
ms
Prism
:
Spectral-Aware
Block-Sparse Attention
arxiv.org
ยท
8h
๐๏ธ
Attention Optimization
Same Engine, Multiple Gears: Parallelizing
Fixpoint
Iteration at Different
Granularities
(Extended Version)
arxiv.org
ยท
1d
๐
CUDA Streams
Show HN:
AetherLang
โ A
DSL
for building AI workflows with visual debugging
github.com
ยท
11h
ยท
Discuss:
Hacker News
๐
ONNX
Local-First AI: How
SLMs
are Fixing the
Latency
Gap ๐ปโจ
dev.to
ยท
1d
ยท
Discuss:
DEV
โก
Flash Attention
hsutter/cppfront
: A personal experimental C++ Syntax 2 -> Syntax 1 compiler
github.com
ยท
1d
๐
Compiler Optimization
Show HN:
LocalGPT
โ A local-first AI assistant in Rust with
persistent
memory
dev.to
ยท
1d
ยท
Discuss:
DEV
๐ก
LSP
Matching
the right LLM for your GPU feels like an art, but I finally
cracked
it
xda-developers.com
ยท
2d
๐
GPU Occupancy
Comparing
accumulate
to C++
23s
fold_left
meetingcpp.com
ยท
2d
๐
Compiler Optimization
C++
Implementing
a Chaos Game
simulator
solarianprogrammer.com
ยท
2d
โ๏ธ
CUTLASS
An
attempt
at a
First-Proof
AI challenge
abhvio.us
ยท
2d
ยท
Discuss:
Hacker News
๐
Kernel Fusion
Weeknotes
2026-W06
โบ Project
Pterodactyl
: incremental architecture
jonmsterling.com
ยท
2d
ยท
Discuss:
Hacker News
,
r/Compilers
๐
Ruff
Revealing effects of the local dimension on a variable-range interacting model by connecting
Lieb-Robinson
bounds and
multipartite
entanglement
link.aps.org
ยท
3d
๐
Kernel Fusion
Anthropics
Compiler
Challenge
corsix.org
ยท
2d
๐
Compiler Optimization
A generic reference defined by consensus peaks for single-cell
ATAC-seq
data analysis
nature.com
ยท
23h
๐๏ธ
TensorRT
Benchmarking
Malloc
with Doom 3
forrestthewoods.com
ยท
2d
๐
Profiling Tools
๐ฒ
GFX
luetkemj.github.io
ยท
3d
๐ฎ
NVIDIA
Fast
Autoscheduling
for Sparse ML
Frameworks
ajroot.pl
ยท
5d
ยท
Discuss:
Hacker News
,
r/Compilers
๐ฏ
Tensor Cores
Modern
Trends
In
Floating-Point
semiengineering.com
ยท
5d
๐ฏ
Tensor Cores
Understanding How GIL Affects
Checkpoint
Performance in
PyTorch
Training
shayon.dev
ยท
3d
ยท
Discuss:
Hacker News
โฑ๏ธ
CUDA Events
Getting started with
GSL
-
GNU
Scientific Library on Windows, macOS and Linux
solarianprogrammer.com
ยท
2d
๐ฆ
uv
Loading...
Loading more...
« Page 4
โข
Page 6 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help