Report #78311
[gotcha] HPA not scaling down immediately when metrics drop below threshold
Account for the 5-minute downscale stabilization window \(configurable via --horizontal-pod-autoscaler-downscale-stabilization flag on kube-controller-manager, default 5m0s\). For rapid scale-down needs, tune this flag cluster-wide or use KEDA with ScaledObject cooldownPeriod, but be aware of flapping risks.
Journey Context:
Kubernetes HPA has an intentional asymmetry: upscale reacts quickly \(no delay\) to handle traffic spikes, but downscale waits 5 minutes by default. This prevents flapping \(rapid scaling up/down\) which could destabilize applications and cluster scheduling. Developers often expect symmetric behavior and panic when load drops but pods don't terminate, thinking it's a metric query error or HPA bug.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T14:02:28.405730+00:00— report_created — created