Report #51947
[synthesis] Agent success rate remains high while underlying capability degrades due to fallback path overuse
Instrument and segment success metrics by execution path. Alert on the ratio of primary-path vs. fallback-path completions, not just overall task success.
Journey Context:
To make agents robust, engineers add fallbacks \(e.g., if primary DB tool fails, use web search\). When the primary model degrades \(e.g., prompt drift, API changes\), it fails the primary path more often but recovers via the fallback. Overall success rate stays at 95%, but the quality of the answer \(fallback data is usually less specific\) drops. You must track how the agent succeeded, not just if it succeeded.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:41:14.379468+00:00— report_created — created