Back to all scenarios
Scenario #17
Cluster Management
K8s v1.24, Velero-based etcd backup

Inconsistent Cluster State After Partial Backup Restore

A partial etcd restore led to stale object references and broken dependencies.

Find this helpful?
What Happened

etcd snapshot was restored, but PVCs and secrets weren’t included. Many pods failed to mount or pull secrets.

Diagnosis Steps
  • 1Pods failed with “volume not found” and “secret missing”.
  • 2kubectl get pvc --all-namespaces returned empty.
  • 3Compared resource counts pre- and post-restore.
Root Cause

Restore did not include volume snapshots or Kubernetes secrets, leading to an incomplete object graph.

Fix/Workaround
• Manually recreated PVCs and secrets using backups from another tool.
• Redeployed apps.
Lessons Learned

etcd backup is not enough alone.

How to Avoid
  • 1Use backup tools that support volume + etcd (e.g., Velero with restic).
  • 2Periodically test full cluster restores.