Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
⚡ High Performance
latency, throughput, benchmarking, optimization, profiling
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
200269
posts in
35.2
ms
Benchmarking
Subquadratic
's latest model and
SSA
Kernel
📊
AI Performance Profiling
appen.com
·
10h
·
Hacker News
How Superhuman and Databricks built a
200K
QPS
inference platform together
🏗️
LLM Infrastructure
databricks.com
·
6d
Exploring
LLMs Speed
Benchmarks
🏗️
LLM Infrastructure
mlops.community
·
1d
Let's talk
benchmarking
📊
Benchmarking
spacetimedb.com
·
8h
·
r/rust
nviennot/core-to-core-latency
: Measures the latency between CPU
cores
⚙️
CPUs
github.com
·
2d
·
Hacker News
AMD
uProf
5.3: Profiling Tool Gets
DuckDB
, Faster Reports and More Zen Analysis
⚡
Performance Tools
igorslab.de
·
21h
AI
versus
Throughput
📊
AI Performance Profiling
michaelnygard.com
·
3d
MLCommons
Chakra
: Advancing Performance Benchmarking and Co-design using Standardized Execution Traces
⚡
Performance Mythology
arxiv.org
·
1d
Tenstorrent
Unveils Galaxy AI Platform Targeting Scale and
Efficiency
🌊
Streaming Systems
forbes.com
·
17h
·
Hacker News
Scaling PCIe Controllers for AI
Bandwidth
: A
Multistream
Architecture Analysis for 64 GT/s and 128 GT/s
🎮
GPU Microarchitecture
semiengineering.com
·
1d
What Breaks at
1M
AI
Requests
per Day?
📊
Model Serving Economics
digitalocean.com
·
3d
Jankmarking
: Janky
Benchmarking
📊
AI Performance Profiling
williamangel.net
·
6d
·
Hacker News
Non-Monotonic
Latency in Apple MPS Decoding: KV Cache Interactions and Execution
Regimes
🎯
Emulator Accuracy
arxiv.org
·
2d
FractalSortCPU
: Bandwidth-Efficient Compressed
Radix
Sort on CPU
📊
Columnar Databases
arxiv.org
·
2d
·
Hacker News
KV-RM
:
Regularizing
KV-Cache Movement for Static-Graph LLM Serving
🎯
Data Locality
arxiv.org
·
2d
Beyond
Static
Policies: Exploring Dynamic Policy
Selection
for Single-Thread Performance Optimization
⏱️
Runtime Performance Analysis
arxiv.org
·
6d
Enhancing Instruction
Prefetching
via Cache and
TLB
Management
🖥️
Hardware Architecture
arxiv.org
·
1d
CUDAHercules
:
Benchmarking
Hardware-Aware Expert-level CUDA Optimization for LLMs
🏗️
LLM Infrastructure
arxiv.org
·
2d
Latency
Analysis and Optimization of
Alpamayo
1 via Efficient Trajectory Generation
⚡
Interpreter Optimization
arxiv.org
·
2d
gym-invmgmt
: An Open Benchmarking Framework for Inventory Management Methods
🏆
LLM Benchmarking
arxiv.org
·
1d
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help