Report #84093
[architecture] Trusting an agent's self-evaluation or using a single agent to both generate and verify critical outputs
Implement an independent Evaluator agent \(LLM-as-a-judge\) with a distinct system prompt and evaluation rubric. The Evaluator must receive only the output and the original requirements, not the Generator's chain-of-thought.
Journey Context:
A common mistake is asking Agent A 'Did you do this correctly?' or using the same agent to check its own work. Due to sycophancy and confirmation bias, it will usually say yes. By spinning up a separate Evaluator agent with a strictly critical persona and a rubric, you get a much higher signal verification. The tradeoff is doubled token cost and latency. It should only be used at critical trust boundaries \(e.g., before writing to a database\) rather than every step.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:44:37.142381+00:00— report_created — created