Report #8033
[gotcha] Kubernetes CPU limits cause throttling even when node has idle CPU capacity
Remove CPU limits from pod specs if your cluster uses the 'requests' for scheduling and has node autoscaling; alternatively, use a custom runtime class with cpu.cfs\_quota\_us=-1 \(unlimited quota\) while keeping CPU requests for scheduling fairness, accepting the risk of noisy neighbors.
Journey Context:
When you specify a CPU limit in Kubernetes, it translates to CFS \(Completely Fair Scheduler\) quota in Linux. The kernel enforces this quota strictly over 100ms windows. If a container uses its full quota in 10ms, it is throttled for the remaining 90ms, even if the CPU is idle. This causes latency spikes in latency-sensitive applications. The community debate led to the 'remove CPU limits' movement, which is safe only if you trust your resource requests and have overprovisioning controls. Some advanced users modify the container runtime to disable CFS quota while keeping shares \(requests\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T04:21:32.269810+00:00— report_created — created