API Server High Latency Due to Event Flooding

An app spamming Kubernetes events slowed down the entire API server.

Find this helpful?

What Happened

A custom controller logged frequent events (~50/second), causing the etcd event store to choke.

Diagnosis Steps

Root Cause

No rate limiting on event creation in controller logic.

Fix/Workaround

• Patched controller to rate-limit record.Eventf.
• Cleaned old events.

Lessons Learned

Events are not free – they impact etcd/API server.

How to Avoid