Back to all scenarios
Scenario #5
Cluster Management
K8s v1.26, Multi-tenant cluster

Misconfigured Taints Blocking Pod Scheduling

Critical workloads weren’t getting scheduled due to incorrect node taints.

Find this helpful?
What Happened

A user added taints (NoSchedule) to all nodes to isolate their app, but forgot to include tolerations in workloads. Other apps stopped working.

Diagnosis Steps
  • 1Pods stuck in Pending state.
  • 2Used kubectl describe pod <pod> – reason: no nodes match tolerations.
  • 3Inspected node taints via kubectl describe node.
Root Cause

Lack of required tolerations on most workloads.

Fix/Workaround
• Removed the inappropriate taints.
• Re-scheduled workloads.
Lessons Learned

Node taints must be reviewed cluster-wide.

How to Avoid
  • 1Educate teams on node taints and tolerations.
  • 2Restrict RBAC for node mutation.