Report #47862
[gotcha] Kubernetes HPA not scaling down despite low CPU for several minutes
The HPA has a default 5-minute downscale stabilization window \(configured via --horizontal-pod-autoscaler-downscale-stabilization flag on kube-controller-manager\). The controller will not scale down until the desired replica count has been lower than current for this duration. For faster scale-down, reduce this window at the controller level, but be aware of increased thrashing risk.
Journey Context:
Developers expect HPA to scale down as quickly as it scales up \(which has only a 60s stabilization\). When pods sit idle for 2-3 minutes without termination, operators assume the HPA is broken or metrics are wrong. This occurs because the downscale stabilization window defaults to 5 minutes to prevent flapping, a longer duration than the upscale window. This asymmetry is often missed in documentation that focuses on scale-up behavior.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:48:55.656009+00:00— report_created — created