Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Performance Engineering
⚡ Performance Engineering
Profiling, Benchmarking, Optimization Techniques, Cache Efficiency
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
29
posts in
13.7
ms
Defeat the Heap:
Zero-Copy
Data Movement in AXI4MLIR
🔄
Compiler Design
Content type:
Academic
arxiv.org
·
20h
20 hours ago
·
Hacker News
Actions for Defeat the Heap: Zero-Copy Data Movement in AXI4MLIR
ModPageSpeed 2.0: Lighthouse 56 to 90. On your own servers
🚢
DevOps
Content type:
Discussion
modpagespeed.com
·
3d
3 days ago
·
Hacker News
Actions for ModPageSpeed 2.0: Lighthouse 56 to 90. On your own servers
Apple WWDC On-Device AI Deep Dive - Google Docs
🔄
Compiler Design
gist.is
·
2h
2 hours ago
·
Hacker News
Actions for Apple WWDC On-Device AI Deep Dive - Google Docs
Two Leaps to 1000 Tokens/s on a 1T-Parameter Model: On Inference Systems, Execution Boundaries, and
Co-Design
🧠
Computer Architecture
Content type:
Blog
tilert.ai
·
2d
2 days ago
·
Hacker News
Actions for Two Leaps to 1000 Tokens/s on a 1T-Parameter Model: On Inference Systems, Execution Boundaries, and Co-Design
HFT
Latency
Monitoring with Probabilistic Calling
Context
👁️
Observability
hftuniversity.com
·
1d
1 day ago
·
Hacker News
Actions for HFT Latency Monitoring with Probabilistic Calling Context
harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
🧠
Computer Architecture
Content type:
Code
github.com
·
4d
4 days ago
·
Hacker News
Actions for harshuljain13/llm-inference-at-scale: A Practitioner handbook for production llm serving.
ClaudeHeads
🔄
Compiler Design
Content type:
Blog
fknil.pages.dev
·
1d
1 day ago
·
Lobsters
Actions for ClaudeHeads
Apples to Apples: MLX vs. Llama.cpp for Gemma 4 12B on an M1 16GB
🧠
Computer Architecture
Content type:
Blog
ziraph.com
·
5d
5 days ago
·
Hacker News
Actions for Apples to Apples: MLX vs. Llama.cpp for Gemma 4 12B on an M1 16GB
DiffusionGemma: 4x Faster Text Generation
🧠
Computer Architecture
Content type:
News
Content type:
Blog
blog.google
·
8h
8 hours ago
·
Hacker News
,
r/LocalLLaMA
,
r/singularity
Actions for DiffusionGemma: 4x Faster Text Generation
The
perils
of UUID primary keys in SQLite
🗄
Database Systems
andersmurphy.com
·
6d
6 days ago
·
Lobsters
,
Hacker News
,
r/programming
Actions for The perils of UUID primary keys in SQLite
Now available: Amazon EC2 M9g and M9gd instances powered by new AWS Graviton5 processors
✅
Formal Verification
Content type:
Blog
aws.amazon.com
·
9h
9 hours ago
·
Hacker News
Actions for Now available: Amazon EC2 M9g and M9gd instances powered by new AWS Graviton5 processors
A cute little trick to running classic IIR filters on the GPU
🧠
Computer Architecture
Content type:
Blog
themaister.net
·
2d
2 days ago
·
Hacker News
Actions for A cute little trick to running classic IIR filters on the GPU
Global
memory
shortage throws wrench into IT
pros
’ budgets, planning
🧠
Computer Architecture
Content type:
News
itbrew.com
·
2d
2 days ago
·
Hacker News
Actions for Global memory shortage throws wrench into IT pros’ budgets, planning
Launch HN: General Instinct (YC P26) – Frontier models on edge devices
⚙️
Engineering
Content type:
Discussion
news.ycombinator.com
·
5d
5 days ago
·
Hacker News
Actions for Launch HN: General Instinct (YC P26) – Frontier models on edge devices
aussiealex/agentmeter: Know what your agents cost. Cost intelligence for AI coding agents.
🌐
HTMX
Content type:
Code
github.com
·
11h
11 hours ago
·
Hacker News
Actions for aussiealex/agentmeter: Know what your agents cost. Cost intelligence for AI coding agents.
On-device AI is a margin decision
🧠
Computer Architecture
Content type:
Blog
ziraph.com
·
6h
6 hours ago
·
Hacker News
Actions for On-device AI is a margin decision
The Road to Component Model 1.0
📦
WebAssembly
bytecodealliance.org
·
3d
3 days ago
·
Hacker News
Actions for The Road to Component Model 1.0
Full
Context
on a Vulkan-Only Strix Halo: The Decode-Drop Reproduces, but the Sweet Spot Moves
🧠
Computer Architecture
thefrontierlab.ai
·
6d
6 days ago
·
Hacker News
Actions for Full Context on a Vulkan-Only Strix Halo: The Decode-Drop Reproduces, but the Sweet Spot Moves
How We Ditched Postgres for ClickHouse to Process 12 Billion
Caches
Per
Day
🗄
Database Systems
Content type:
Blog
momentic.ai
·
6d
6 days ago
·
Hacker News
Actions for How We Ditched Postgres for ClickHouse to Process 12 Billion Caches Per Day
The economics of speculative decoding
🧠
Computer Architecture
Content type:
Blog
fergusfinn.com
·
3d
3 days ago
·
Hacker News
Actions for The economics of speculative decoding
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help