Back to all scenarios
Scenario #407
Scaling & Load
Kubernetes v1.25, KEDA + Knative
Scale-To-Zero Caused Cold Starts and SLA Violations
Pods scaled to zero, but requests during cold start breached SLA.
Find this helpful?
What Happened
First request after inactivity hit cold-start delay of ~15s.
Diagnosis Steps
- 1Prometheus response latency showed spikes after idle periods.
- 2Knative logs: cold-start events.
Root Cause
Cold starts on scale-from-zero under high latency constraint.
Fix/Workaround
• Added minReplicaCount: 1 to high-SLA services.
Lessons Learned
Scale-to-zero saves cost, but not for latency-sensitive apps.
How to Avoid
- 1Use minReplicaCount and warmers for performance-critical services.