Back to all scenarios
Scenario #431
Scaling & Load
Kubernetes v1.24, AWS EKS
Inconsistent Pod Scaling During Burst Traffic
Pod scaling inconsistently triggered during burst traffic spikes, causing service delays.
Find this helpful?
What Happened
A traffic burst caused sporadic scaling events that didn’t meet demand, leading to delayed responses.
Diagnosis Steps
- 1Observed scaling logs that showed pod scaling lagged behind traffic spikes.
- 2Metrics confirmed traffic surges weren't matched by scaling.
Root Cause
Insufficient scaling thresholds and long stabilization windows for HPA.
Fix/Workaround
• Adjusted HPA settings to lower the stabilization window and set appropriate scaling thresholds.
Lessons Learned
HPA scaling settings should be tuned to handle burst traffic effectively.
How to Avoid
- 1Use lower stabilization windows for quicker scaling reactions.
- 2Monitor scaling efficiency during traffic bursts.