Report #38684
[gotcha] Random connection timeouts in containers due to Linux conntrack table exhaustion
Monitor \`/proc/sys/net/netfilter/nf\_conntrack\_count\` against \`nf\_conntrack\_max\`; increase \`net.netfilter.nf\_conntrack\_max\` to at least 4x the expected peak connection count, or lower \`nf\_conntrack\_tcp\_timeout\_established\` from 432000s to 600s for short-lived microservices
Journey Context:
Linux connection tracking \(conntrack\) maintains state for every TCP/UDP flow in the \`nf\_conntrack\` hash table. Default size is often 65536 \(or calculated as \`mem\_total/16384\`\). High-throughput proxies \(Envoy, NGINX\) or databases with many short-lived connections quickly exhaust this. When full, the kernel drops new packets with 'nf\_conntrack: table full, dropping packet' \(visible in \`dmesg\`/\`kern.log\`\). Applications see intermittent 'Connection timed out' errors that correlate with traffic spikes, not code bugs. Teams often scale CPU/memory first, missing the kernel sysctl bottleneck.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:24:23.696030+00:00— report_created — created