Back to all scenarios
Scenario #44
Cluster Management
K8s v1.20, bare metal
Kubelet Crash Due to Out of Memory (OOM) Errors
Kubelet crashed after running out of memory due to excessive pod resource usage.
Find this helpful?
What Happened
The kubelet on a node crashed after the available memory was exhausted due to pods consuming more memory than allocated.
Diagnosis Steps
- 1Checked kubelet logs for OOM errors.
- 2Used kubectl describe node to check resource utilization.
Root Cause
Pod resource requests and limits were not set properly, leading to excessive memory consumption.
Fix/Workaround
• Set proper resource requests and limits on pods to prevent memory over-consumption.
• Restarted the kubelet on the affected node.
Lessons Learned
Pod resource limits and requests are essential for proper node resource utilization.
How to Avoid
- 1Set reasonable resource requests and limits for all pods.
- 2Monitor node resource usage to catch resource overuse before it causes crashes.