The Concurrency Trap: How an Atomic Counter Stalled a Pipeline
redixhumayun.github.ioΒ·29wΒ·
Discuss: Hacker News
Preview
Report Post

Note: I wrote this post for Conviva. You can read the version published on Conviva’s website here

On February 2nd, Conviva’s streaming analytics platform suddenly ground to a crawl but only for one customer. P99 latency spiked without clear reason, pushing our DAG engine to its limits. What started as a puzzling slowdown soon became a deep dive into concurrency pitfalls.

Conviva’s platform is built to handle 5 trillion daily events, powered by a DAG (directed acyclic graph) based analytics engine. Each cu…

Similar Posts

Loading similar posts...