Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
⚡ GPU
cuda,triton
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
81
posts in
8.1
ms
Luce Megakernal: Why nobody is taking about this?
🔱
Triton
github.com
·
5d
·
r/LocalLLaMA
Ollama Doesn't Know Its
GPU
Is on Another Machine
🔱
Triton
loopholelabs.io
·
1d
·
Hacker News
GPU
Memory
Math for LLMs: Formula That Tells You What Fits on Your
GPU
🔱
Triton
theahmadosman.substack.com
·
23h
·
Substack
,
r/LocalLLaMA
FOSS Weekly #26.21: Microsoft's Distro, Bitwarden Drama, Adobe on Linux, New Email Client and More
🔧
Hardware
itsfoss.com
·
5h
·
Hacker News
KV Cache and Flash Attention with interactive diagrams
🔧
Hardware
kvcache.cobanov.dev
·
1d
·
Hacker News
Show HN: FlashAttention-2 in
Cute
, from Scratch
🔱
Triton
blog.echen.io
·
3d
·
Hacker News
Nvidia unveils its spreading language model, "Nemotron-Labs-Diffusion"
🤖
llm
huggingface.co
·
10h
·
Hacker News
The brain still needs the hammer: Why compilers matter MORE in the agent era, not less
🦀
Rust
scale-lang.com
·
2d
·
Hacker News
Nvidia Announces Financial Results for First Quarter Fiscal 2027
🔧
Hardware
nvidianews.nvidia.com
·
23h
·
Hacker News
,
r/wallstreetbets
Running PyTorch Models on Apple Silicon GPUs with the ExecuTorch MLX Delegate
📱
Edge AI
pytorch.org
·
2d
·
Hacker News
The Ultimate LLM Fine-Tuning Guide
🤖
llm
promptinjection.net
·
4d
·
Hacker News
Wes mckinney releases
multiple
bangers
📱
Edge AI
kenn.io
·
3h
·
Hacker News
We made our filesystem 47× faster by deleting it
🦀
Rust
microsandbox.dev
·
2d
·
Hacker News
Benchmarking llama.cpp's brand-new MTP support on Strix Halo
🔧
Hardware
calebcoffie.com
·
3d
·
Hacker News
DashAttention: Differentiable and Adaptive Sparse
Hierarchical
Attention
👁️
Computer vision
arxiv.org
·
2d
·
Hacker News
Luce DFlash + PFlash on 7900XTX: Qwen3.6-27B at 2.24x decode and 3.05x prefill vs llama.cpp HIP
🔧
Hardware
lucebox.com
·
3d
·
r/LocalLLaMA
OpenBSD 7.9 Released
🦀
Rust
openbsd.org
·
2d
·
Lobsters
,
Hacker News
,
r/linux
,
r/opensource
Is Huawei Too Slow on AI?
📱
Edge AI
developer.huawei.com
·
7h
·
Hacker News
China bypasses US
GPU
bans with 1.54-exaflops 'LineShine' supercomputer — CPU-only monster packs 2.4 million Huawei-designed Armv9
cores
🔧
Hardware
tomshardware.com
·
4d
·
Hacker News
,
r/hardware
Architecture & Systems are Changing: The Architect’s Role in the Era of Agentic
Co-Design
🔌
FPGAs
sigarch.org
·
2d
·
Hacker News
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help