Back to all scenarios
Scenario #490
Scaling & Load
Kubernetes v1.24, AWS EKS

Cluster Autoscaler Preventing Scaling Due to Underutilized Nodes

The Cluster Autoscaler prevented scaling because nodes with low utilization were not being considered for scaling.

Find this helpful?
What Happened

The Cluster Autoscaler was incorrectly preventing scaling because it did not consider nodes with low utilization, which were capable of hosting additional pods.

Diagnosis Steps
  • 1Reviewed Cluster Autoscaler logs and found that it was incorrectly marking low-usage nodes as “under-utilized” and therefore not scaling the cluster.
  • 2Observed that other parts of the cluster were under significant load but could not scale due to unavailable resources.
Root Cause

Cluster Autoscaler was not considering nodes with low resource utilization for scaling.

Fix/Workaround
• Reconfigured the Cluster Autoscaler to take node utilization more dynamically into account.
• Enabled aggressive scaling policies to allow under-utilized nodes to host additional workloads.
Lessons Learned

Cluster Autoscaler configuration should be fine-tuned to better handle all types of node utilization scenarios.

How to Avoid
  • 1Regularly review Cluster Autoscaler settings and ensure they are optimized for dynamic scaling.
  • 2Implement monitoring and alerting to detect autoscaling anomalies early.