Back to all scenarios
Scenario #83
Cluster Management
K8s v1.22, GKE
Horizontal Pod Autoscaler Under-Scaling During Peak Load
HPA failed to scale the pods appropriately during a sudden spike in load.
Find this helpful?
What Happened
The HPA did not scale the pods properly during a traffic surge due to incorrect metric thresholds.
Diagnosis Steps
- 1Checked HPA settings and identified that the CPU utilization threshold was too high.
- 2Verified the metric server was reporting correct metrics.
Root Cause
Incorrect scaling thresholds set in the HPA configuration.
Fix/Workaround
• Adjusted HPA thresholds to scale more aggressively under higher loads.
• Increased the replica count to handle the peak load.
Lessons Learned
HPA thresholds should be fine-tuned based on expected load patterns.
How to Avoid
- 1Regularly review and adjust HPA configurations to reflect actual workload behavior.
- 2Use custom metrics for better scaling control.