Back to all scenarios
Scenario #44
Cluster Management
K8s v1.20, bare metal

Kubelet Crash Due to Out of Memory (OOM) Errors

Kubelet crashed after running out of memory due to excessive pod resource usage.

Find this helpful?
What Happened

The kubelet on a node crashed after the available memory was exhausted due to pods consuming more memory than allocated.

Diagnosis Steps
  • 1Checked kubelet logs for OOM errors.
  • 2Used kubectl describe node to check resource utilization.
Root Cause

Pod resource requests and limits were not set properly, leading to excessive memory consumption.

Fix/Workaround
• Set proper resource requests and limits on pods to prevent memory over-consumption.
• Restarted the kubelet on the affected node.
Lessons Learned

Pod resource limits and requests are essential for proper node resource utilization.

How to Avoid
  • 1Set reasonable resource requests and limits for all pods.
  • 2Monitor node resource usage to catch resource overuse before it causes crashes.