Back to all scenarios
Scenario #364
Storage
Kubernetes v1.25, custom CSI driver

CSI Driver Crashes When Node Resource Is Deleted Prematurely

Deleting a node object before the CSI driver detached volumes caused crash loops.

Find this helpful?
What Happened

Admin manually deleted a node before volume detach completed.

Diagnosis Steps
  • 1CSI logs showed panic due to missing node metadata.
  • 2Pods remained in Terminating.
Root Cause

Driver attempted to clean up mounts from a non-existent node resource.

Fix/Workaround
• Waited for CSI driver to timeout and self-recover.
• Rebooted node to forcibly detach volumes.
Lessons Learned

Node deletion should follow strict lifecycle policies.

How to Avoid
  • 1Use node cordon + drain before deletion.
  • 2Monitor CSI cleanup completion before proceeding.