Back to all scenarios
Scenario #475
Scaling & Load
Kubernetes v1.22, Google Cloud

Cluster Autoscaler Not Triggering Under High Load

The Cluster Autoscaler failed to trigger under high load due to misconfiguration in resource requests.

Find this helpful?
What Happened

Despite a high load on the cluster, the Cluster Autoscaler did not trigger additional nodes due to misconfigured resource requests for pods.

Diagnosis Steps
  • 1Reviewed autoscaler logs and resource requests, and discovered that pods were requesting more resources than available on the nodes.
  • 2Resource requests exceeded available node capacity, but the autoscaler did not respond appropriately.
Root Cause

Misconfigured resource requests for pods, leading to poor autoscaler behavior.

Fix/Workaround
• Adjusted resource requests and limits to match node capacity.
• Tuned the Cluster Autoscaler to scale more aggressively during high load situations.
Lessons Learned

Proper resource requests are critical for effective autoscaling.

How to Avoid
  • 1Continuously monitor and adjust resource requests based on actual usage patterns.
  • 2Use autoscaling metrics that consider both resource usage and load.