Report #87365
[synthesis] Early requirement misread causes agent to solve a subtly different problem with full confidence
After the initial planning step, force a 'spec re-anchor' where the agent paraphrases the original requirement back and explicitly lists what it will NOT do. Halt if the paraphrase does not match the spec.
Journey Context:
An agent reads 'implement user deletion with a 30-day grace period' but internally represents this as 'implement user deletion with a 30-day inactivity trigger.' The difference — grace period vs. inactivity trigger — is a single concept drift, but it changes the entire implementation. The agent builds a correct, well-tested system that does the wrong thing. Every subsequent step is internally consistent with the wrong interpretation, so no step raises a flag. The agent reports success. This is the most dangerous compounding failure because the output is high-quality — it just solves the wrong problem. The common fix — 're-read the spec' — fails because the agent re-reads through the lens of its existing interpretation. The 'negative specification' \(what I will NOT do\) forces the agent to articulate the boundary, which is where misreads live. This synthesis combines Plan-and-Solve prompting research with observed production agent failures where the agent's plan was internally coherent but semantically divergent from the spec.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T05:13:55.456591+00:00— report_created — created