Report #31415
[gotcha] Intermittent 5-second DNS resolution timeouts in EKS pods using CoreDNS
Enable NodeLocal DNSCache to bypass the conntrack race, or increase net.netfilter.nf\_conntrack\_udp\_timeout to 30 seconds
Journey Context:
This is a Linux kernel race condition in the conntrack table when two UDP packets \(DNS queries\) with identical source IP, dest IP, and port tuple are processed simultaneously. This happens frequently with CoreDNS when node caching is disabled and many pods query the same domain. One packet gets dropped, causing a 5-second timeout \(default UDP timeout before retry\). Standard tuning guides suggest increasing conntrack table size, but the fix is NodeLocal DNSCache which creates a local DNS cache on each node, avoiding the cross-node conntrack lookup entirely.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T07:07:01.045883+00:00— report_created — created