Back to all scenarios
Scenario #102
Networking
K8s v1.21, Azure AKS
DNS Resolution Failure Due to CoreDNS Pod Crash
DNS resolution failed across the cluster after CoreDNS pods crashed unexpectedly.
Find this helpful?
What Happened
CoreDNS pods were crashed due to resource exhaustion, leading to DNS resolution failure for all services.
Diagnosis Steps
- 1Used kubectl get pods -n kube-system to check the status of CoreDNS pods.
- 2Found that CoreDNS pods were in a crash loop due to memory resource limits being set too low.
Root Cause
CoreDNS resource limits were too restrictive, causing it to run out of memory.
Fix/Workaround
• Increased memory limits for CoreDNS pods.
• Restarted the CoreDNS pods and verified DNS resolution functionality.
Lessons Learned
Ensure CoreDNS has sufficient resources to handle DNS queries for large clusters.
How to Avoid
- 1Regularly monitor CoreDNS metrics for memory and CPU usage.
- 2Adjust resource limits based on cluster size and traffic patterns.