Back to all scenarios
Scenario #426
Scaling & Load
Kubernetes v1.22, GCP
Downscale Too Aggressive During Traffic Dips
Autoscaler scaled down too aggressively during short traffic dips, causing pod churn.
Find this helpful?
What Happened
Traffic decreased briefly, triggering a scale-in, only for the traffic to spike again.
Diagnosis Steps
- 1HPA scaled down to 0 replicas during a brief traffic lull.
- 2Pod churn noticed after every scale-in event.
Root Cause
Aggressive scaling behavior set too low a minReplicas threshold.
Fix/Workaround
• Set a minimum of 1 replica for critical workloads.
• Tuned scaling thresholds to avoid premature downscaling.
Lessons Learned
Aggressive scaling policies can cause instability in unpredictable workloads.
How to Avoid
- 1Use minReplicas for essential workloads.
- 2Implement stabilization windows for both scale-up and scale-down.