Back to all scenarios
Scenario #144
Networking
K8s v1.22, DigitalOcean Kubernetes

TCP Retransmissions Due to Network Saturation

TCP retransmissions increased due to network saturation, leading to degraded pod-to-pod communication.

Find this helpful?
What Happened

Pods in the cluster started experiencing increased latency and timeouts, which was traced back to TCP retransmissions caused by network saturation.

Diagnosis Steps
  • 1Analyzed network performance using tcpdump and found retransmissions occurring during periods of high traffic.
  • 2Verified that there was no hardware failure, but network bandwidth was fully utilized.
Root Cause

Insufficient network bandwidth during high traffic periods.

Fix/Workaround
• Increased network bandwidth allocation for the cluster.
• Implemented QoS policies to prioritize critical traffic.
Lessons Learned

Network saturation can severely affect pod communication, especially under heavy loads.

How to Avoid
  • 1Use quality-of-service (QoS) and bandwidth throttling to prevent network saturation.
  • 2Regularly monitor network bandwidth and adjust scaling policies to meet traffic demands.