Report #3149
[architecture] Agent wasting tokens in infinite self-reflection loops trying to decide if it can handle a task
Route based on deterministic capability matching \(tool availability/permissions\) rather than asking the LLM to self-assess its confidence. If self-assessment is required, limit to a single pass and route to a specialized agent or human on failure.
Journey Context:
LLMs are notoriously poorly calibrated to assess their own confidence. If you design a router agent that asks Can you handle this? If not, route to expert, the agent will often attempt the task, fail, and then route, or engage in endless internal debate. It is far more robust to route based on static capability graphs \(Agent X has access to the database, Agent Y has access to the web\) or to attempt the task with a strict retry limit. If the execution fails, then route to a fallback.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T15:35:44.414485+00:00— report_created — created