Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🚀 Performance
Broad
Benchmarking, Profiling, Optimization, Bottlenecks
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
186407
posts in
19.1
ms
ExaBench
: An Open Database Performance
Leaderboard
🧮
Vector Databases
exasol.com
·
1d
·
Hacker News
Fourth
Data
Prefetching
Championship: Part I
⚙
Laptop optimization
sigarch.org
·
3d
RuC
:
HDL-Agnostic
Rule Completion Benchmark Generation
🔗
RAG
arxiv.org
·
7h
Optimization vs. Architecture:
Knowing
the
Difference
🧮
Vector Databases
tigerdata.com
·
2d
MauroCE/m3serve
: Optimised BAAI/bge-m3 serving with dense + sparse + ColBERT embeddings, async dynamic batching and pipeline GPU inference
🧮
Vector Databases
github.com
·
4d
·
r/SideProject
atomic_
queue
benchmarks
SMT
vs
no-SMT
performance
⬛
Ditherpunk
max0x7ba.github.io
·
2d
·
r/cpp
,
r/linux
Announcing Arm
Performix
:
Empowering
developers with scalable performance for the age of AI agents
🦙
Ollama
newsroom.arm.com
·
2d
·
Hacker News
[
WIP
] Benchmarking Local LLMs Against Coding Agent
Harnesses
🦙
Ollama
neuralnoise.com
·
3d
·
Hacker News
TurboQuant
on a MacBook Pro, part 2: perplexity, KL
divergence
, and asymmetric K/V on M5 Max
⬛
Ditherpunk
llmkube.com
·
2d
·
r/LocalLLaMA
DeepSeek-V4 on Day 0: From Fast Inference to Verified
RL
with
SGLang
and Miles
🧮
Vector Databases
lmsys.org
·
5d
·
Hacker News
How we built the most performant DeepSeek V3.2, MiniMax-M2.5 and Qwen 3.5
397B
on DigitalOcean NVIDIA
HGX
™ B300 GPU Droplets
🦙
Ollama
digitalocean.com
·
3d
Show HN:
Utilyze
, an open source GPU monitoring tool more accurate than
nvtop
⚙
Laptop optimization
systalyze.com
·
3d
·
Hacker News
Vibing
, Harness and
OODA
loop
🦙
Ollama
architecture-weekly.com
·
4d
What 2x
GH200
delivers: memory
paths
for LLM inference
💫
slick production values
dnhkng.github.io
·
6d
Introducing
SOB
: A Multi-Source
Structured
Output Benchmark for LLMs
🦙
Ollama
interfaze.ai
·
3d
·
Hacker News
FineState-Bench
: Benchmarking
State-Conditioned
Grounding for Fine-grained GUI State Setting
⬛
Ditherpunk
arxiv.org
·
7h
Lambda
Calculus
Benchmark for AI
🦙
Ollama
victortaelin.github.io
·
6d
·
Hacker News
70x faster cold(
ish
) starts for
SGLang
💫
slick production values
fergusfinn.com
·
6d
·
Hacker News
Reimagining Kernel Generation at the
PTX
Layer: An LLM System Learning from
DSLs
to Outperform Them
🦙
Ollama
standardkernel.com
·
3d
·
Hacker News
Optimize
Anything
with LLMs
🦙
Ollama
gepa-ai.github.io
·
6d
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help