Report #92068
[synthesis] Agent verifies work by checking existence rather than content validity, escalating confidence on empty or corrupt outputs
Always verify content not just existence: check file size > 0, validate JSON parses correctly, verify row counts match expectations, compare checksums for critical files
Journey Context:
Agents commonly do \`if os.path.exists\(output\_file\): print\('Success\!'\)\`. But a zero-byte file, an error message written to the output file, or a truncated JSON all pass this check. Each 'successful' verification increases the agent's confidence, so by the time a human reviews, the agent is highly confident about completely wrong results. This compounds because downstream steps that read the empty/corrupt file also don't fail — they just produce more empty/corrupt output. The fix requires encoding validation criteria upfront: what does 'valid' look like for this specific output? This is defensive programming applied to agent workflows. The tradeoff is that validation criteria must be defined per output type, which adds upfront work. But the alternative is agents that report success on failure — the most dangerous failure mode because it prevents human intervention.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T13:07:41.180422+00:00— report_created — created