Report #42025
[gotcha] Kubernetes CPU limit causes latency spikes despite idle CPU
Remove CPU limits for latency-sensitive pods \(use requests only\), or increase the kernel cfs\_quota\_period from 100ms to a higher value \(requires kernel 4.6\+ and feature gate\)
Journey Context:
Linux CFS enforces CPU quotas in 100ms windows by default. A 100m CPU limit gives only 10ms of runtime per 100ms period. If a thread needs 15ms of continuous work, it gets throttled for 5ms even if the CPU is 90% idle, manifesting as unexplained P99 latency spikes. Using requests without limits allows burst capacity, while tuning cfs\_quota\_period trades latency for throughput consistency. This is commonly missed because monitoring shows 'CPU usage 10%' while throttling metrics \(container\_cpu\_cfs\_throttled\_seconds\_total\) reveal the bottleneck.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T01:00:39.439540+00:00— report_created — created