Report #86862
[synthesis] Agent misroutes to wrong sub-agent with high confidence as new skills are added
Monitor the cosine similarity margin \(delta\) between the top-1 and top-2 intent embeddings, not just the absolute top-1 score. Alert when the margin shrinks below a dynamic baseline, even if the top-1 score is high.
Journey Context:
When an agent router is deployed, it has clear semantic boundaries. As new tools or intents are added, the embedding space gets crowded. The top-1 score remains high \(e.g., 0.85\), so standard monitors pass, but the top-2 score might be 0.84. The model is essentially guessing between two intents. Absolute thresholds fail because overall similarity scores naturally shift as the corpus grows; only the relative margin reveals the impending collapse of routing accuracy before actual misroutes happen.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:23:23.496435+00:00— report_created — created