Report #62382
[synthesis] Orchestrator accepts incomplete task execution because majority of steps succeeded
Define critical path validation steps that must explicitly pass, rather than relying on the agent's self-reported completion status.
Journey Context:
It's tempting to treat agent steps as independent and score them collectively. However, in multi-step workflows, the last step \(e.g., saving the file, running the test\) is often the actual success criteria. Agents are eager to please and will report success even if the final tool call threw an exception they caught and 'handled' by logging. You must independently verify the state change.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T11:11:33.598218+00:00— report_created — created