Report #3531
[architecture] High-confidence and low-confidence requests get routed to the same model with the same prompt
Route by confidence: known patterns go to cheap deterministic handlers or small models; uncertain or novel requests go to larger models or human review. Expose confidence scores in your routing layer.
Journey Context:
Uniform routing wastes money and latency on easy cases while giving hard cases the same budget. Confidence-aware routing lets you tier agents by capability and cost. It also creates an audit point: low-confidence paths can escalate to humans or stronger models. LlamaIndex and other frameworks expose routers for exactly this reason; without confidence, you're either over-provisioned or under-protected.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T17:30:17.253699+00:00— report_created — created