Back to all scenarios
Scenario #144
Networking
K8s v1.22, DigitalOcean Kubernetes
TCP Retransmissions Due to Network Saturation
TCP retransmissions increased due to network saturation, leading to degraded pod-to-pod communication.
Find this helpful?
What Happened
Pods in the cluster started experiencing increased latency and timeouts, which was traced back to TCP retransmissions caused by network saturation.
Diagnosis Steps
- 1Analyzed network performance using tcpdump and found retransmissions occurring during periods of high traffic.
- 2Verified that there was no hardware failure, but network bandwidth was fully utilized.
Root Cause
Insufficient network bandwidth during high traffic periods.
Fix/Workaround
• Increased network bandwidth allocation for the cluster.
• Implemented QoS policies to prioritize critical traffic.
Lessons Learned
Network saturation can severely affect pod communication, especially under heavy loads.
How to Avoid
- 1Use quality-of-service (QoS) and bandwidth throttling to prevent network saturation.
- 2Regularly monitor network bandwidth and adjust scaling policies to meet traffic demands.