Report #17648
[gotcha] Kubernetes HPA memory-based scaling lag causing OOMKills
Do not rely on HPA for memory spikes; scale on CPU or custom metrics \(e.g., request queue depth\). If memory scaling is required, use KEDA with a ScaledObject or configure HPA behavior with a 0s stabilization window for immediate scale-up, paired with VPA for right-sizing.
Journey Context:
Teams see memory spikes \(e.g., Java heap growth\) and assume HPA will save them. But HPA uses average utilization over time; a sudden spike OOMKills the pod before the metric is reported and the autoscaler reacts. Scaling on CPU or external metrics \(e.g., Kafka lag\) is predictive. KEDA allows event-driven scaling based on message queue length, which correlates better with memory pressure than memory itself.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T05:54:52.626742+00:00— report_created — created