Report #5997
[gotcha] Application latency spikes despite CPU utilization showing only 50-70% of limit due to CFS quota throttling
Remove CPU limits entirely \(relying on requests for isolation\) or enable CPU CFS quota bursting by setting cpu.cfs\_burst\_us \(requires kernel 5.4\+ and cgroup v2\) to allow short-term usage spikes without throttling
Journey Context:
The Linux CFS \(Completely Fair Scheduler\) enforces CPU limits using 100ms quota periods by default. A container with a 1 CPU limit gets 100ms of CPU time per 100ms period. Bursty workloads \(e.g., processing a request spike\) that need 20ms of work every 50ms will exhaust their quota early in each period and be throttled for the remainder, causing latency. Standard metrics average CPU over 30-60s, hiding these micro-spikes. This is often misdiagnosed as garbage collection or application inefficiency.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T22:47:38.136711+00:00— report_created — created