Report #85828
[gotcha] Container throttled by CPU limit despite low average CPU utilization
Remove CPU limits for latency-sensitive workloads \(retain requests\), or use \`cpu.cfs\_quota\_us\` tuning via kubelet \`--cpu-cfs-quota-period\` flag \(requires v1.25\+ feature gate\) to shorten the throttling window.
Journey Context:
Linux CFS scheduler enforces CPU quotas in 100ms windows \(cfs\_period\_us\). A container with a 100m limit \(0.1 vCPU\) gets 10ms of CPU time per 100ms window. If a Go/Java app has a GC spike or a Rust app has a burst, it uses its 10ms instantly, then is throttled for the remaining 90ms even if its average over 1s is only 10%. This manifests as p99 latency spikes with no CPU usage increase in metrics.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:39:07.425389+00:00— report_created — created