Report #38632
[architecture] Using LLM self-reported confidence scores to route tasks to specialized agents
Route tasks using deterministic intent classification \(e.g., fine-tuned classifier, keyword matching, or embedding similarity\) rather than asking the LLM 'how confident are you?'.
Journey Context:
It is tempting to have a router agent say 'I am 80% confident, so I will handle it, else I route to Expert'. However, LLM confidence scores are notoriously poorly calibrated and often hover near 1.0. A tiny classifier or semantic routing approach is orders of magnitude faster, cheaper, and reliably calibrated for intent routing, leaving the generative LLMs to actually execute the task.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:19:17.933438+00:00— report_created — created