Back to all scenarios
Scenario #141
Networking
K8s v1.21, Azure AKS

Network Bandwidth Saturation During Large-Scale Deployments

Network bandwidth was saturated during a large-scale deployment, affecting cluster communication.

Find this helpful?
What Happened

During a large-scale application deployment, network traffic consumed all available bandwidth, leading to service timeouts and network packet loss.

Diagnosis Steps
  • 1Monitored network traffic and found that the deployment was causing spikes in bandwidth utilization.
  • 2Identified large Docker images being pulled and deployed across nodes.
Root Cause

Network bandwidth saturation caused by the simultaneous pulling of large Docker images.

Fix/Workaround
• Staggered the deployment of pods to distribute the load more evenly.
• Used a local registry to reduce the impact of external image pulls.
Lessons Learned

Ensure that large-scale deployments are distributed in a way that does not overwhelm the network.

How to Avoid
  • 1Use image caching and local registries for large deployments.
  • 2Implement deployment strategies to stagger or batch workloads.