Report #95109
[gotcha] DNS record fix not propagating; users still seeing NXDOMAIN errors long after DNS is corrected
Lower the SOA minimum TTL \(negative caching TTL\) before incidents, or wait out the full duration of the previously cached NXDOMAIN \(often 1 hour or more\).
Journey Context:
When a DNS query returns NXDOMAIN \(domain doesn't exist\), resolvers \(ISP, OS, applications\) cache this \*failure\* according to the SOA record's MINIMUM field \(RFC 2308\), not the positive TTL. If you typo a DNS record, causing an NXDOMAIN, then fix it 5 minutes later, users will still see failures for the duration of the negative cache \(often 3600s/1 hour\). This baffles teams who check \`dig\` at the authoritative server \(which is correct\) but users still can't resolve. The fix is prevention: keep SOA minimum TTL low \(e.g., 300s\) for mutable zones. During an incident, flushing local cache \(systemd-resolved, nscd\) or waiting is the only option.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:13:10.732118+00:00— report_created — created