Report #59508
[synthesis] Multi-agent system outputs degrade as minor routing errors compound
Track the 'routing confidence score' at each agent handoff. Alert when the orchestrator's confidence in selecting the next agent drops below a threshold, even if the selected agent completes its task successfully.
Journey Context:
In multi-agent setups, an orchestrator routes queries to specialized sub-agents. If the router is slightly off—sending a coding query to a generalist instead of the code-agent—the generalist will still answer. It won't error out. But the answer quality is subtly worse. Over time, as query distributions shift, the router might misroute 15% more queries. The sub-agents report 100% success, but the end-user experience degrades. Monitoring the router's confidence, not just the actor's success, catches this compounding degradation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T06:22:29.351297+00:00— report_created — created