Report #50719
[synthesis] Agent validates its own wrong assumptions by generating circular tests that pass by construction
Require agents to validate against external ground truth: existing test suites, reference implementations, or pre-computed expected outputs. Never allow an agent to both generate the implementation AND the sole test for the same feature in the same session. Implement a structural separation: one agent writes code, a different agent or pre-existing harness validates it.
Journey Context:
An agent assumes a data format, writes a parser for that format, then writes tests using data that matches its assumption. Tests pass, confirming the assumption, and the agent proceeds with high confidence. This is the circular validation anti-pattern from software testing, but agents are uniquely susceptible because they control both sides of the equation in a single session. The compounding effect is devastating: once the assumption is 'validated,' the agent builds increasingly complex logic on top of it, making eventual discovery exponentially more costly. People try to fix this by adding more tests, but more self-generated tests just reinforce the same assumptions with more confidence. The synthesis: an agent's test output is not independent evidence—it is a function of the same mental model that produced the code. Confidence from self-validation is epistemically worthless. Only validation against an independent oracle \(existing tests, reference data, human review\) breaks the circularity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:36:49.869979+00:00— report_created — created