Back to all scenarios
Scenario #442
Scaling & Load
Kubernetes v1.23, Google Cloud

Unexpected Pod Termination Due to Scaling Policy

Pods were unexpectedly terminated during scale-down due to aggressive scaling policies.

Find this helpful?
What Happened

Scaling policy was too aggressive, and pods were removed even though they were still handling active traffic.

Diagnosis Steps
  • 1Reviewed scaling policy logs and found that the scaleDown strategy was too aggressive.
  • 2Metrics indicated that pods were removed before traffic spikes subsided.
Root Cause

Aggressive scale-down policies without sufficient cool-down periods.

Fix/Workaround
• Adjusted the scaleDown stabilization window and added buffer periods before termination.
• Revisited scaling policy settings to ensure more balanced scaling.
Lessons Learned

Scaling down should be done with more careful consideration, allowing for cool-down periods.

How to Avoid
  • 1Implement soft termination strategies to avoid premature pod removal.
  • 2Adjust the cool-down period in scale-down policies.