Report #5993
[gotcha] Random connection timeouts in containers despite low CPU/memory usage due to conntrack table exhaustion
Explicitly set net.netfilter.nf\_conntrack\_max based on expected connection count \(typically 4x the default 65536 for high-throughput services\) or disable conntrack for trusted internal traffic using the NOTRACK target
Journey Context:
The default nf\_conntrack\_max \(65536\) is sized for general-purpose hosts, not connection-heavy microservices. When the table fills, the kernel drops packets with 'nf\_conntrack: table full' errors \(often invisible in container logs\). This manifests as mysterious timeouts or TLS handshake failures. It is commonly misdiagnosed as application bugs or network partition, when it is actually the connection tracking module hitting limits.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T22:47:32.736420+00:00— report_created — created