Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
⚡ GPU
cuda,triton
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
77
posts in
7.5
ms
Dissecting ThunderKittens: Anatomy of a Compact DSL for High-Performance AI
Kernels
🔱
Triton
hamzaelshafie.bearblog.dev
·
3d
·
Lobsters
Characterization of machine learning compilers for LLM inference on NVIDIA GPUs
📱
Edge AI
link.springer.com
·
23h
·
Hacker News
A Kubernetes operator for local LLMs across Nvidia and Mac fleets
🤖
ROS2
llmkube.com
·
1d
·
Hacker News
lupinemachines/lupine: LUPINE is a
GPU
over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.
🔱
Triton
github.com
·
12h
·
Hacker News
NVCF
Is Now Open Source: Inside NVIDIA's
GPU
Function Platform
🔱
Triton
blog.kubesimplify.com
·
2d
·
Hacker News
,
r/golang
,
r/programming
Ollama Doesn't Know Its
GPU
Is on Another Machine
🔱
Triton
loopholelabs.io
·
4d
·
Hacker News
WarpSpeed
approaches Speed of Light on Blackwell
🔱
Triton
doubleai.com
·
18h
·
Hacker News
‘
Corpse
Point’ In the Arctic Is Melting, Disturbing Centuries-Old Bodies
🌊
Ocean Sensing
404media.co
·
1d
·
Hacker News
KV Cache and Flash Attention with interactive diagrams
🔧
Hardware
kvcache.cobanov.dev
·
4d
·
Hacker News
First ever Cray T3D Supercomputer goes up for auction with $81,000 reserve — Europe’s fastest supercomputer in June 1996 goes on the block
🔧
Hardware
tomshardware.com
·
12h
·
Hacker News
The brain still needs the hammer: Why compilers matter MORE in the agent era, not less
🦀
Rust
scale-lang.com
·
5d
·
Hacker News
A
warp
drive with predominantly positive invariant energy density and
global
Hawking-Ellis Type I
🗺️
Motion Planning
arxiv.org
·
2d
·
Hacker News
Benchmarking llama.cpp's brand-new MTP support on Strix Halo
🔧
Hardware
calebcoffie.com
·
6d
·
Hacker News
Luce DFlash + PFlash on 7900XTX: Qwen3.6-27B at 2.24x decode and 3.05x prefill vs llama.cpp HIP
🔧
Hardware
lucebox.com
·
6d
·
r/LocalLLaMA
NVIDIA Removes Gaming Revenue Category From Financial Reports
🔱
Triton
guru3d.com
·
3d
·
Hacker News
,
r/LocalLLaMA
Crooked Forest
🎨
Neural Rendering
en.wikipedia.org
·
2d
·
Hacker News
Running PyTorch Models on Apple Silicon GPUs with the ExecuTorch MLX Delegate
📱
Edge AI
pytorch.org
·
6d
·
Hacker News
When Generation Becomes Cheap, Selection Becomes Governance
🎮
mujoco
lospino.so
·
13h
·
Hacker News
The End of a Craft?
🏭
Robotic Manufacturing
neuribs.substack.com
·
10h
·
Substack
GPU
Memory
Math for LLMs: Formula That Tells You What Fits on Your
GPU
🔱
Triton
theahmadosman.substack.com
·
4d
·
Substack
,
r/LocalLLaMA
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help