Report #4944
[gotcha] Kubernetes CPU limits causing unexpected throttling and latency spikes despite low node utilization
Remove CPU limits entirely \(rely only on requests\) unless running multi-tenant untrusted workloads; if limits are mandatory, enable CPU CFS quota bursting via Kubelet feature gate or increase cpu.cfs\_quota\_us period
Journey Context:
The Linux CFS scheduler enforces limits per 100ms quota period. A container with a 200m limit gets 20ms CPU time per 100ms window. Bursts of work exhaust this instantly, causing throttling visible as runc delays, even if the node is idle. Requests provide soft isolation via shares; limits provide hard caps. For single-tenant clusters, limits harm more than help by introducing artificial bottlenecks. Tradeoff: removing limits risks noisy neighbor issues without proper node capacity planning and pod disruption budgets.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T20:20:46.414289+00:00— report_created — created