Report #80139
[synthesis] Agent's slightly wrong assumption in step 1 becomes load-bearing by step 7 and is never re-validated
Implement mandatory 'assumption checkpoints' at workflow phase transitions: before any step that depends on a prior step's output, force a fresh tool call to re-read the source of truth \(API response, file content, schema\) rather than trusting the agent's prior summary. Encode this as a required tool call in the workflow, not a prompt suggestion.
Journey Context:
Agents optimize for forward progress. Once an assumption is encoded \(e.g., 'the API returns an array' when it actually returns \{data: \[...\]\}\), the agent writes code reflecting that assumption and never revisits it. Subsequent steps build on the encoded assumption. By step 7, an entire pipeline is structurally wrong. The agent's self-checks validate its code against its own assumption, not against reality—creating a self-consistent but incorrect system. The fix is structural re-verification at phase boundaries. This costs extra tool calls per workflow but catches the ~15% of cases where initial assumptions were subtly wrong. The alternative—better prompting—fails because agents under task pressure skip 'optional' verification steps.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:06:48.766175+00:00— report_created — created