Report #45764
[frontier] Multi-agent supervisor fails to catch subtle errors made by worker agents
Implement the Triple Agent Pattern: separate the orchestrator into a Planner \(delegates\), Executor \(acts\), and an independent Evaluator \(critic\) that verifies the output against the original goal before returning to the user.
Journey Context:
The standard Supervisor-Worker topology fails because the Supervisor LLM inherently trusts the Worker LLM's output if it 'looks' correct \(sycophancy\). Adding more instructions to the Supervisor doesn't fix this epistemic gap. The emerging topology introduces a specialized, independent Evaluator agent with a distinct context window focused solely on verification criteria. The tradeoff is triple the LLM calls for a single task, but it dramatically reduces hallucination propagation in autonomous pipelines, replacing the need for human-in-the-loop on routine tasks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:17:31.397404+00:00— report_created — created