Back to all scenarios
Scenario #350
Storage
Kubernetes v1.26, CSI, NVMes
Node Crash Triggers Volume Remount Loop
After a node crash, volume remount loop occurred due to conflicting device paths.
Find this helpful?
What Happened
Volume had a static device path cached in CSI driver. Upon node recovery, OS assigned a new device path. CSI couldn't reconcile.
Diagnosis Steps
- 1CSI logs: device path not found.
- 2Pod remained in ContainerCreating.
- 3OS showed volume present under different path.
Root Cause
CSI assumed static device path, OS changed it post-reboot.
Fix/Workaround
• Added udev rules for consistent device naming.
• Restarted CSI daemon to detect new device path.
Lessons Learned
Relying on device paths can break persistence.
How to Avoid
- 1Use device UUIDs or filesystem labels where supported.
- 2Restart CSI pods post-reboot events.