Auto-diagnosing Kubernetes alerts with HolmesGPT and CNCF tools (opens in new tab)
What a two-person SRE team learned building an AI investigation pipeline. Spoiler: the runbooks mattered more than the model. Why we built this At STCLab, our SRE team supports multiple Amazon EKS clusters running high-traffic production...
Read the original article