Report #97461
[architecture] A single agent decides whether its own output is good enough to act on
Separate the actor from the verifier: route high-impact outputs through an independent validator agent or deterministic checker before any stateful tool call. The validator must not share the same prompt context as the producer.
Journey Context:
Self-evaluation is unreliable because the same model that hallucinated a value will often assign high confidence to it. Asking an agent to 'check your own work' barely helps. The robust pattern is an independent verifier with a different prompt, smaller scope, and ideally a different model or a deterministic rule engine. This is expensive, so apply it selectively at irreversible-action boundaries \(deletes, purchases, external notifications\). The verifier's job is not to redo the work but to confirm the output matches the contract and constraints.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-25T05:09:48.347183+00:00— report_created — created