Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🚀 Performance
Broad
Benchmarking, Profiling, Optimization, Bottlenecks
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
21334
posts in
17.6
ms
ExaBench
: An Open Database Performance
Leaderboard
🧮
Vector Databases
exasol.com
·
1d
·
Hacker News
[
WIP
] Benchmarking Local LLMs Against Coding Agent
Harnesses
🦙
Ollama
neuralnoise.com
·
3d
·
Hacker News
Granite
4.1: IBM's
8B
Model Is Competing With Models Four Times Its Size
🦙
Ollama
firethering.com
·
21h
·
Hacker News
Utilyze
measures how
efficiently
your GPU is doing useful work
⚙
Laptop optimization
github.com
·
13h
·
Hacker News
TurboQuant
on a MacBook Pro, part 2: perplexity, KL
divergence
, and asymmetric K/V on M5 Max
⬛
Ditherpunk
llmkube.com
·
2d
·
r/LocalLLaMA
DeepSeek-V4 on Day 0: From Fast Inference to Verified
RL
with
SGLang
and Miles
🧮
Vector Databases
lmsys.org
·
5d
·
Hacker News
Openweight
Benchmark
🧮
Vector Databases
openweightbench.pages.dev
·
14h
·
Hacker News
KV
Cache
Locality
: The Hidden Variable in Your LLM Serving Cost
⚙
Laptop optimization
ranvier.systems
·
1d
·
Hacker News
Issue 649
💡
New and interesting problems
datascienceweekly.substack.com
·
10h
·
Substack
Odysseys
: Benchmarking Web Agents on
Realistic
Long Horizon Tasks
🦙
Ollama
odysseys-website.pages.dev
·
1d
·
Hacker News
Show HN:
Utilyze
, an open source GPU monitoring tool more accurate than
nvtop
⚙
Laptop optimization
systalyze.com
·
3d
·
Hacker News
PEAKS No 42: The Open-Weight Uprising: GPT-5.5, Qwen Beats a
397B
Giant, and Your
Jira
Data Is Now AI Training Fuel
🦙
Ollama
bogdandeac.com
·
2d
Vibing
, Harness and
OODA
loop
🦙
Ollama
architecture-weekly.com
·
4d
Show HN:
1990s
Game Dev
Algorithms
for Distributed Systems
🦙
Ollama
docs.merca.earth
·
2d
·
Hacker News
GPT-5.5:
Capabilities
and
Reactions
🦙
Ollama
thezvi.wordpress.com
·
2d
Introducing
SOB
: A Multi-Source
Structured
Output Benchmark for LLMs
🦙
Ollama
interfaze.ai
·
3d
·
Hacker News
Reaching
SOTA
Without Breaking the Bank: Using
AI21
Maestro to optimize deep research agents
🦙
Ollama
ai21.com
·
2d
·
Hacker News
Reimagining Kernel Generation at the
PTX
Layer: An LLM System Learning from
DSLs
to Outperform Them
🦙
Ollama
standardkernel.com
·
3d
·
Hacker News
Containerized
data centers help avoid many
pitfalls
in AI deployments
⌨️
Cyberdeck Building
techzine.eu
·
2d
AI
evals
are becoming the new compute
bottleneck
🦙
Ollama
huggingface.co
·
1d
·
Hacker News
Page 2 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help