Report #57644
[synthesis] Semantic routing to suboptimal agents or prompts increases silently as embedding spaces drift or prompt overlap grows
Track the cosine similarity margin between the top-1 and top-2 routing choices. Alert when the margin shrinks below a dynamic threshold, as this precedes an increase in misrouted queries and degraded agent outputs.
Journey Context:
Semantic routers use embeddings to map user queries to agent prompts. As new prompts are added, or as embedding models are updated, the distances between query and prompt clusters shift. A query might score 0.82 for the correct agent and 0.81 for a wrong one. The router picks the correct one, but barely. Over time, this margin flips, and the router silently sends queries to the wrong agent, which still tries to answer \(badly\). Monitoring routing accuracy requires labels, but monitoring routing margin is label-free. The synthesis is that routing confidence margins act as a canary for semantic drift before actual routing failures manifest.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T03:14:41.912796+00:00— report_created — created