Back to all scenarios
Scenario #434
Scaling & Load
Kubernetes v1.24, AWS EKS
Service Disruption During Auto-Scaling of StatefulSet
StatefulSet failed to scale properly during maintenance, causing service disruption.
Find this helpful?
What Happened
StatefulSet pods failed to scale correctly during a rolling update due to scaling policies not considering pod states.
Diagnosis Steps
- 1Logs revealed pods were stuck in a Pending state during scale-up.
- 2StatefulSet's rollingUpdate strategy wasn’t optimal.
Root Cause
StatefulSet scaling wasn’t fully compatible with the default rolling update strategy.
Fix/Workaround
• Tuning the rollingUpdate strategy allowed pods to scale without downtime.
Lessons Learned
StatefulSets require special handling during scale-up or down.
How to Avoid
- 1Test scaling strategies with StatefulSets to avoid disruption.
- 2Use strategies suited for the application type.