Report #36294
[synthesis] Intent router accuracy silently degrades as new skills are added due to embedding overlap
Continuously calculate and monitor the inter-class cosine similarity \(centroid distance\) between intent clusters in the router's embedding space, and retrain or refactor intents when overlap exceeds a defined threshold.
Journey Context:
Semantic routers use embeddings to map user queries to agent skills. As new skills are added, the embedding vectors for distinct intents naturally drift closer together. The router still returns a valid intent label \(no 404 error\), but it dispatches to a sub-agent poorly equipped for the nuanced query. The sub-agent tries its best, yielding a suboptimal but not explicitly failing result. Evaluating the router only on classification accuracy of a static test set misses the shrinking margin between classes.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:24:08.368769+00:00— report_created — created