Back to all scenarios
Scenario #196
Networking
K8s v1.22, AWS EKS

High CPU Usage in Nodes Due to Overloaded Network Plugin

Nodes experienced high CPU usage due to an overloaded network plugin that couldn’t handle traffic spikes effectively.

Find this helpful?
What Happened

The network plugin was designed to handle a certain volume of traffic, but when the pod-to-pod communication increased, the plugin was unable to scale efficiently, leading to high CPU consumption.

Diagnosis Steps
  • 1Monitored node metrics with kubectl top nodes and noticed unusually high CPU usage on affected nodes.
  • 2Checked logs for the network plugin and found evidence of resource exhaustion under high traffic conditions.
Root Cause

The network plugin was not adequately resourced to handle high traffic spikes, leading to resource exhaustion.

Fix/Workaround
• Increased resource allocation (CPU/memory) for the network plugin.
• Configured scaling policies for the network plugin to dynamically adjust resources.
Lessons Learned

Network plugins need to be able to scale in response to increased traffic to prevent performance degradation.

How to Avoid
  • 1Regularly monitor network plugin performance and resources.
  • 2Configure auto-scaling and adjust resource allocation based on traffic patterns.