Back to all scenarios
Scenario #107
Networking
K8s v1.23, AWS EKS

Network Bottleneck Due to Single Node in NodePool

A network bottleneck occurred due to excessive traffic being handled by a single node in the node pool.

Find this helpful?
What Happened

One node in the node pool was handling all the traffic for multiple pods, leading to CPU and network saturation.

Diagnosis Steps
  • 1Checked node utilization with kubectl top node and identified a single node with high CPU and network load.
  • 2Verified the load distribution across the node pool and found uneven traffic handling.
Root Cause

The cluster autoscaler did not scale the node pool correctly due to resource limits on the instance type.

Fix/Workaround
• Increased the size of the node pool and added more nodes with higher resource capacity.
• Rebalanced the pods across nodes and monitored for stability.
Lessons Learned

Autoscaler configuration and node resource distribution are critical for handling high traffic.

How to Avoid
  • 1Ensure that the cluster autoscaler is correctly configured to balance resource load across all nodes.
  • 2Monitor traffic patterns and node utilization regularly.