Back to all scenarios
Scenario #93
Cluster Management
K8s v1.22, Azure AKS
Pod Readiness Probe Failure Due to Slow Initialization
Pods failed their readiness probes during initialization, causing traffic to be routed to unhealthy instances.
Find this helpful?
What Happened
The pods had a slow initialization time, but the readiness probe timeout was set too low, causing premature failure.
Diagnosis Steps
- 1Checked pod events and logs, discovering that readiness probes were failing due to long startup times.
- 2Increased the timeout period for the readiness probe and observed that the pods began to pass the probe after startup.
Root Cause
Readiness probe timeout was set too low for the pod's initialization process.
Fix/Workaround
• Increased the readiness probe timeout and delay parameters.
• Re-applied the deployment, and the pods started passing readiness checks.
Lessons Learned
The readiness probe timeout should be configured according to the actual initialization time of the pod.
How to Avoid
- 1Monitor pod initialization times and adjust readiness probe configurations accordingly.
- 2Use a gradual rollout for new deployments to avoid sudden failures.