Back to all scenarios
Scenario #431
Scaling & Load
Kubernetes v1.24, AWS EKS

Inconsistent Pod Scaling During Burst Traffic

Pod scaling inconsistently triggered during burst traffic spikes, causing service delays.

Find this helpful?
What Happened

A traffic burst caused sporadic scaling events that didn’t meet demand, leading to delayed responses.

Diagnosis Steps
  • 1Observed scaling logs that showed pod scaling lagged behind traffic spikes.
  • 2Metrics confirmed traffic surges weren't matched by scaling.
Root Cause

Insufficient scaling thresholds and long stabilization windows for HPA.

Fix/Workaround
• Adjusted HPA settings to lower the stabilization window and set appropriate scaling thresholds.
Lessons Learned

HPA scaling settings should be tuned to handle burst traffic effectively.

How to Avoid
  • 1Use lower stabilization windows for quicker scaling reactions.
  • 2Monitor scaling efficiency during traffic bursts.