API Server Slowdowns from High Watch Connection Count

API latency rose sharply due to thousands of watch connections from misbehaving clients.

Find this helpful?

What Happened

Multiple pods opened persistent watch connections and never closed them, overloading the API server.

Diagnosis Steps

Root Cause

Custom controller with poor watch logic never closed connections.

Fix/Workaround

• Restarted offending pods.
• Updated controller to reuse watches.

Lessons Learned

Unbounded watches can exhaust server resources.

How to Avoid