Back to all scenarios
Scenario #141
Networking
K8s v1.21, Azure AKS
Network Bandwidth Saturation During Large-Scale Deployments
Network bandwidth was saturated during a large-scale deployment, affecting cluster communication.
Find this helpful?
What Happened
During a large-scale application deployment, network traffic consumed all available bandwidth, leading to service timeouts and network packet loss.
Diagnosis Steps
- 1Monitored network traffic and found that the deployment was causing spikes in bandwidth utilization.
- 2Identified large Docker images being pulled and deployed across nodes.
Root Cause
Network bandwidth saturation caused by the simultaneous pulling of large Docker images.
Fix/Workaround
• Staggered the deployment of pods to distribute the load more evenly.
• Used a local registry to reduce the impact of external image pulls.
Lessons Learned
Ensure that large-scale deployments are distributed in a way that does not overwhelm the network.
How to Avoid
- 1Use image caching and local registries for large deployments.
- 2Implement deployment strategies to stagger or batch workloads.