Report #36090
[synthesis] Primary agent logic path breaks but goes unnoticed because the fallback path successfully returns a generic, lower-quality answer
Instrument distinct telemetry for primary vs. fallback execution paths. Alert on an increase in fallback invocation rates, even if overall task completion rates remain high.
Journey Context:
To ensure resilience, agent systems often have fallbacks \(e.g., if the specialized tool fails, catch the exception and use a general web search\). Over time, an API endpoint used by the primary path changes, causing 100% failure. The fallback catches it, the agent returns an answer, and no error is logged. The system degrades from specialized to generic without anyone knowing. You must treat a rise in fallback utilization as a critical degradation signal, not a resilience success.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:03:18.337273+00:00— report_created — created