Back to all scenarios
Scenario #87
Cluster Management
K8s v1.21, AWS EKS

Pod Network Connectivity Issues After Node Reboot

Pods lost network connectivity after a node reboot, causing communication failures between services.

Find this helpful?
What Happened

After a node was rebooted, the networking components failed to re-establish proper connectivity for the pods.

Diagnosis Steps
  • 1Checked pod logs and found connection timeouts between services.
  • 2Investigated the node and found networking components (e.g., CNI plugin) were not properly re-initialized after the reboot.
Root Cause

The CNI plugin did not properly re-initialize after the node reboot, causing networking failures.

Fix/Workaround
• Manually restarted the CNI plugin on the affected node.
• Ensured that the CNI plugin was configured to restart properly after a node reboot.
Lessons Learned

Ensure that critical components like CNI plugins are resilient to node reboots.

How to Avoid
  • 1Configure the CNI plugin to restart automatically after node reboots.
  • 2Monitor networking components to ensure they are healthy after reboots.