Report #93250
[synthesis] Complex queries routed to simple agents producing plausible but shallow answers
Log the confidence score of the semantic router alongside the routed destination; alert on increases in low-confidence routing \(e.g., scores between 0.4 and 0.6\) which indicate distribution shift, rather than waiting for explicit failure flags.
Journey Context:
Multi-agent systems use embedding similarity to route queries. As user query distributions shift over time \(e.g., users asking multi-part questions\), the router's confidence drops. However, if the threshold is too low, it still routes to the closest \(but wrong\) agent. The chosen agent answers the part it understands, ignoring the rest. The user gets an answer, so it's not a failure, but the query is only partially resolved. Monitoring routing accuracy is hard; monitoring routing confidence distribution catches the drift early.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:06:26.465313+00:00— report_created — created