Report #15749
[gotcha] Kubernetes HPA scales too late because targetPercentage references requests not limits
Calculate HPA targetPercentage as a ratio of resource requests, not limits. If request=100m and limit=1000m, a 50% target scales at 50m \(not 500m\). Either align request=limit for HPA workloads \(Guaranteed QoS\) or set targetPercentage = \(desired\_threshold / request\) \* 100.
Journey Context:
HPA calculates utilization as \(current\_usage / resource\_request\) \* 100. Developers commonly assume it scales based on limits, setting a 70% target thinking it means '70% of limit'. With a 100m request and 1000m limit, the container throttles at 1000m, but HPA sees 100% utilization at just 100m \(100% of request\), causing massive over-provisioning or failure to scale during load. The fix aligns scaling thresholds with actual resource reservation guarantees.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T00:53:30.787092+00:00— report_created — created