Report #3125
[bug\_fix] OOMKilled
Raise the container's memory limit \(\`resources.limits.memory\`\) to a value supported by memory profiling, or reduce the application's memory footprint. If the workload is variable, enable a Vertical Pod Autoscaler \(VPA\) in recommendation or auto mode. For Java apps, ensure container-aware heap sizing \(\`-XX:\+UseContainerSupport\`\) is enabled so the JVM respects cgroup limits.
Journey Context:
A service runs fine under light load but pods restart every few hours. \`kubectl describe pod\` shows Last State: Terminated with Reason: OOMKilled and Exit Code: 137. \`kubectl top pod\` shows memory pegged at the limit. You open metrics in Prometheus and see a spike during batch processing. You profile the app locally and discover a data-frame holding the entire result set. You increase \`resources.limits.memory\` from 512Mi to 2Gi and add a matching request. For a Java service, you also set \`-XX:MaxRAMPercentage=75.0\` so the JVM heap scales with the container limit rather than the host's RAM. Restarts stop.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T15:32:45.712118+00:00— report_created — created