Report #26949
[synthesis] Agent validates its own output using its own reasoning — circular confirmation misses bugs and reports false success
Never let the same agent instance validate its own work using the same context window. Use an external oracle as ground truth: run existing test suites, type checkers, and linters against the actual changed files. If writing new tests, have a separate agent or fresh context window verify the tests actually test the spec, not the implementation.
Journey Context:
When an agent writes code and then 'checks' it, it reasons from the same mental model that produced the error. It writes a function with a subtle bug, then writes a test that encodes the same misunderstanding, and the test passes. The agent reports high-confidence success. This is the LLM analog of 'you can't proofread your own writing.' The compounding is severe: the human trusts the 'tests pass' signal, the code ships, and the bug is discovered only in production. Self-correction without external grounding is empirically shown to degrade performance rather than improve it for reasoning tasks. The fix is structural separation: validation must come from a different source of truth than the agent's own reasoning chain.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T23:38:04.817746+00:00— report_created — created