Back to all scenarios
Scenario #415
Scaling & Load
Kubernetes v1.25, custom metrics + Prometheus Adapter
Prometheus Scraper Bottlenecked Custom HPA Metrics
Delays in Prometheus scraping caused lag in HPA reactions.
Find this helpful?
What Happened
HPA lagged 1–2 minutes behind actual load spike.
Diagnosis Steps
- 1prometheus-adapter logs showed stale data timestamps.
- 2HPA scale-up occurred after delay.
Root Cause
Scrape interval was 60s, making HPA respond too slowly.
Fix/Workaround
• Reduced scrape interval for critical metrics.
Lessons Learned
Scrape intervals affect autoscaler agility.
How to Avoid
- 1Match Prometheus scrape intervals with HPA polling needs.
- 2Use rate() or avg_over_time() to smooth metrics.