Report #66096
[synthesis] Silent fallback to a weaker model causes logical degradation without throwing errors
Tag every agent trace with the exact model version and provider. Alert on model distribution shifts, not just error rates. If the ratio of primary-model to fallback-model completions drops below a threshold, trigger a quality alert.
Journey Context:
To ensure high availability, routing layers fall back to weaker models when the primary hits rate limits. The agent completes successfully, so the SLA looks green, but the reasoning depth is severely compromised. Ops teams only monitor 5xx errors, missing the silent semantic downgrade caused by infrastructure failover. You must monitor the model identity distribution as a proxy for quality.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T17:25:22.491071+00:00— report_created — created