CSI Controller Pod Crash Due to Log Overflow

The CSI controller crashed repeatedly due to unbounded logging filling up ephemeral storage.

Find this helpful?

What Happened

A looped RPC error generated thousands of log lines per second. Node /var/log/containers hit 100% disk usage.

Diagnosis Steps

Root Cause

Verbose logging + missing log throttling + small disk.

Fix/Workaround

• Added log rate limits via CSI plugin config.
• Increased node ephemeral storage.

Lessons Learned

Logging misconfigurations can become outages.

How to Avoid