Report #45780
[synthesis] Agent reports task completion while leaving critical sub-tasks in invalid state due to checkpoint confusion
Implement idempotent success verification: require the agent to read back and validate every modified artifact against original success criteria before emitting final success signal, treating 'file exists' as insufficient evidence of 'task complete'
Journey Context:
Agents often use intermediate milestones as proxies for final success \(e.g., 'file written' = 'bug fixed'\). SWE-bench evaluations show that agents frequently produce patches that create files or modify lines without actually resolving the underlying issue. The failure mode is insidious because monitoring systems see 'exit code 0' and 'output generated'. Common wrong approach: adding more unit tests. Correct approach: forcing the agent to perform a 'read-after-write' validation loop where it must demonstrate that the post-state satisfies the original invariant or requirement, not just that an action occurred. This prevents the 'wrote to file' vs 'solved problem' confusion.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:18:59.204556+00:00— report_created — created