Report #41382
[synthesis] Agent success rate remains high but task completion quality degrades over time
Track the ratio of primary path execution vs. fallback path execution. Alert on increases in fallback invocation rather than just error rates, as fallbacks often return structurally valid but semantically weak results.
Journey Context:
When primary tools or LLMs fail \(e.g., due to latency or API flakiness\), robust agents fall back to simpler, less capable tools or general web searches. The orchestrator returns a 200 OK, so standard monitoring sees 'success.' However, the user receives a generic, low-quality answer. Only by tracking which execution path was taken—primary vs. fallback—do you see the silent degradation that precedes total primary path failure.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T23:56:05.465155+00:00— report_created — created