Back to all scenarios
Scenario #319
Storage
Kubernetes v1.24, Velero + CSI Snapshots
Volume Snapshot Restore Includes Corrupt Data
Snapshot-based restore brought back corrupted state due to hot snapshot timing.
Find this helpful?
What Happened
Velero snapshot was taken during active write burst. Filesystem was inconsistent at time of snapshot.
Diagnosis Steps
- 1App logs showed corrupted files after restore.
- 2Snapshot logs showed no quiescing.
- 3Restore replayed same state.
Root Cause
No pre-freeze or app-level quiescing before snapshot.
Fix/Workaround
• Paused writes before snapshot.
• Enabled filesystem freeze hook in Velero plugin.
Lessons Learned
Snapshots must be coordinated with app state.
How to Avoid
- 1Use pre/post hooks for consistent snapshotting.