Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
Performance Profiling
📊 Performance Profiling
Benchmarking, Flame Graphs, CPU Analysis, Memory Profiling
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
167
posts in
10.1
ms
We went multi-region then undid it
🌐
Distributed systems
Content type:
Blog
useautumn.com
·
2d
2 days ago
·
Hacker News
Actions for We went multi-region then undid it
tensorhq/suture-stream-repair:
Ultra-low-latency
reverse proxy that repairs truncated & malformed JSON in LLM streaming responses (OpenAI, Anthropic, Vertex AI, Bedrock) — fixes JSONDecodeError / serde_json EOF on truncated tool
calls
.
🧩
Microservices
Content type:
Code
github.com
·
6d
6 days ago
·
Hacker News
Actions for tensorhq/suture-stream-repair: Ultra-low-latency reverse proxy that repairs truncated & malformed JSON in LLM streaming responses (OpenAI, Anthropic, Vertex AI, Bedrock) — fixes JSONDecodeError / serde_json EOF on truncated tool calls.
AI Voice Agent Architecture: How Real-Time Conversational Systems Work
🌐
Distributed systems
faridfadaie.com
·
5h
5 hours ago
·
Hacker News
Actions for AI Voice Agent Architecture: How Real-Time Conversational Systems Work
Integrate on-device AI models into your app using Core AI - WWDC26 - Videos
💻
Programming languages
developer.apple.com
·
2d
2 days ago
·
Hacker News
Actions for Integrate on-device AI models into your app using Core AI - WWDC26 - Videos
Magenta RealTime 2: Open and Local Live Music Models
📮
Message Queues
magenta.withgoogle.com
·
6d
6 days ago
·
Hacker News
,
Hacker News
,
r/LocalLLaMA
Actions for Magenta RealTime 2: Open and Local Live Music Models
memory
OS for AI agents (ranks, compresses and evolves agents
memory
)
🛡️
Odin
thrindex.com
·
43m
43 minutes ago
·
Hacker News
Actions for memory OS for AI agents (ranks, compresses and evolves agents memory)
Why AI code optimization needs production-grounded
benchmarks
📊
Systems Monitoring
Content type:
Blog
datadoghq.com
·
2d
2 days ago
·
Hacker News
Actions for Why AI code optimization needs production-grounded benchmarks
How Agoda Scaled Its Feature Store 50X with ScyllaDB
💾
Storage Engines
hackernoon.com
·
6d
6 days ago
Actions for How Agoda Scaled Its Feature Store 50X with ScyllaDB
ashp15205/guardian-runtime: A
zero-latency
, local-first runtime firewall for LLMs. Intercept every prompt and response locally to stop data
leaks
and runaway token costs.
📊
Systems Monitoring
Content type:
Code
github.com
·
1d
1 day ago
·
Hacker News
Actions for ashp15205/guardian-runtime: A zero-latency, local-first runtime firewall for LLMs. Intercept every prompt and response locally to stop data leaks and runaway token costs.
How We Ditched Postgres for ClickHouse to Process 12 Billion Caches
Per
Day
🗃️
Database Internals
Content type:
Blog
momentic.ai
·
5d
5 days ago
·
Hacker News
Actions for How We Ditched Postgres for ClickHouse to Process 12 Billion Caches Per Day
Premature Optimization is Fun Sometimes
⚡
SIMD Optimization
invlpg.com
·
2d
2 days ago
·
Lobsters
,
Hacker News
Actions for Premature Optimization is Fun Sometimes
Looking Inside Chromium’s On-Device AI Stack
📦
Data Serialization
Content type:
Blog
island.io
·
4h
4 hours ago
·
Hacker News
Actions for Looking Inside Chromium’s On-Device AI Stack
The Return of Rigorous Full-System Timing Simulation
⚡
SIMD Optimization
sigarch.org
·
2d
2 days ago
·
Hacker News
Actions for The Return of Rigorous Full-System Timing Simulation
iSCSI vs. NVMe/TCP: The ultimate storage showdown for Red Hat OpenShift Virtualization
💾
Storage Engines
developers.redhat.com
·
6d
6 days ago
·
Hacker News
Actions for iSCSI vs. NVMe/TCP: The ultimate storage showdown for Red Hat OpenShift Virtualization
lbj96347/nemotron-3.5-asr-ios: On-device, offline speech recognition for iPhone/iPad using NVIDIA's Nemotron-3.5-ASR Streaming 0.6B (multilingual) via CoreML.SwiftUI app with
mic
capture + audio file import, RNN-Tdecoding, and live
benchmark
metrics (
latency
, RTF, memory).
🛡️
Odin
Content type:
Code
github.com
·
16h
16 hours ago
·
Hacker News
Actions for lbj96347/nemotron-3.5-asr-ios: On-device, offline speech recognition for iPhone/iPad using NVIDIA's Nemotron-3.5-ASR Streaming 0.6B (multilingual) via CoreML.SwiftUI app with mic capture + audio file import, RNN-Tdecoding, and live benchmark metrics (latency, RTF, memory).
Nex-N2-mini: A 35B Model Built for Autonomous Agents
🛡️
Odin
hackernoon.com
·
14h
14 hours ago
Actions for Nex-N2-mini: A 35B Model Built for Autonomous Agents
Show HN: A Highly Available Distributed Router for Global Realtime AI
🌐
Distributed systems
Content type:
Blog
cerebrium.ai
·
6d
6 days ago
·
Hacker News
Actions for Show HN: A Highly Available Distributed Router for Global Realtime AI
NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet
🔥
DataFusion
huggingface.co
·
2d
2 days ago
·
Hacker News
Actions for NexusOS v2.0 – A zero-dependency pipeline streaming server chaos to Parquet
"North Mini Code"; open weights, 30B param, Canadian coding model
💻
Programming languages
Content type:
Blog
cohere.com
·
1d
1 day ago
·
Hacker News
Actions for "North Mini Code"; open weights, 30B param, Canadian coding model
Your Lambda isn't
leaking
memory
— your metrics are lying to you
🧠
Memory Management
Content type:
Blog
engineering.taktile.com
·
6d
6 days ago
·
Hacker News
Actions for Your Lambda isn't leaking memory — your metrics are lying to you
« Page 1
·
Page 3 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help