Kubernetes Outage Postmortem: Nodes Stuck in NotReady Due to CNI Failure
Recently, we encountered a critical production outage in our Kubernetes cluster. New nodes provisioned during autoscaling remained in a NotReady state, leading to service disruptions and failed health checks across workloads. In this post, I’ll walk ...
May 28, 20253 min read89


