Report #97900
[architecture] Router always sends tasks to the most capable agent and overspends
Route by confidence: use a fast embedding or classifier layer to score which agent/skill fits a task, then escalate only when confidence is below threshold.
Journey Context:
Not every task needs a frontier model. Semantic routing uses cheap vector similarity to pick a path and falls back to a stronger agent when uncertain. The tradeoff is an extra routing step, but it cuts cost and latency dramatically for high-volume systems. The anti-pattern is a single all-powerful agent handling every request.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-26T04:53:19.420719+00:00— report_created — created