Tail Latency

Feeds to Scour
SubscribedAll
Scoured 43 posts in 13.4 ms

Fairness-Aware and Latency-Controllable Scheduling for Chunked-Prefill LLM Serving

 🔄Cache Replacement  Content type: Academic
arxiv.org·

Tuning SCHED_BATCH for Non-Interactive, CPU-Bound Workloads

 🗓️Scheduling  Content type: News  Content type: Blog

HFT Latency Monitoring with Probabilistic Calling Context

 📊Profiling
hftuniversity.com··Hacker News

Benchmarking OpenZFS vs EXT4 for my NAS | Heitor's log

 📋Copy-on-Write
heitorpb.github.io·
Less-relevant results

servetarslan02/HookSniff: ?? Reliable webhook delivery for developers. 11 SDKs. Rust API + Next.js dashboard. Send webhooks. We deliver them. Failed? We retry.

 🛡️Fault Tolerance  Content type: Code
github.com··DEV

Monitor SLAs and scale ClickHouse Cloud with clickhousectl and agents

 📊Columnar Execution  Content type: Blog
clickhouse.com·

1-bit and 1.58 bit LLM Benchmarking on Jetson Orin Nano Super | Bonsai LM

 🎮GPU Scheduling
smolhub.com··r/LocalLLaMA

Kafka Share Groups and Parallelizing Consumption - Part 3: Client-local parallelism

 📦Micro Batching  Content type: Blog
jack-vanlightly.com·

Full Observability for Pinecone: Introducing an Open-Source Monitoring Stack for SaaS and BYOC

 🔐Validator Clients  Content type: Blog
pinecone.io·

What Building a Multi-Model AI Gateway Taught Me About Reliability

 📊Profiling
openrain.ai··DEV

We Cut Semgrep's Taint Analysis Time by 75%

 📊Liveness Analysis  Content type: Blog
semgrep.dev··Hacker News

Apache Spark Real-Time Mode for Gaming: A Better Way to Do Real-Time Sessionization

 📦Micro Batching  Content type: Blog
databricks.com·

Building Production Multi-Agent Systems: Real-World Lessons from Genie

 🤝Paxos Consensus

AI-Native Closed-Loop Security for 6G-Enabled Cyber-Physical Systems: From Edge Detection to Network-Wide Mitigation

 🌐FPGA Networking  Content type: Academic
arxiv.org·

History of the Internet: From ARPANET to the Modern Web

 Zig

guycipher/keybench: A scriptable, extensible performance tool for sorted key value stores.

 💾Memtable  Content type: Code
github.com··Hacker News

SPA: A SQL-Plan-Aware Reinforcement Learning Framework for Query Rewriting with LLMs

 ⚙️Adaptive Execution  Content type: Academic
arxiv.org·

What to look for when selecting a real-time analytical database

 🔒Transactions  Content type: Blog
clickhouse.com·

Anycast Performance in Context

 🔀Receive Side Scaling  Content type: Academic
arxiv.org·

JoniMartin27/lookspan: Local-first observability dashboard for AI agents. MCP-native. Look at every span your agents emit.

 📦In-process Databases  Content type: Code
github.com··Hacker News

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help