Report #90007
[synthesis] Agent makes one incorrect inference early and then all subsequent reasoning is constrained to be consistent with that wrong assumption
At every major decision point, re-derive key assumptions from source evidence rather than carrying forward inferred state. Implement 'assumption checkpoints' where the agent explicitly lists its current assumptions and verifies each against primary data before proceeding.
Journey Context:
Chain-of-thought research demonstrates that step-by-step reasoning improves accuracy. The dark side—visible only when you synthesize CoT research with agent failure analysis—is that CoT creates a coherence trap: once an assumption is stated in the reasoning chain, all subsequent steps are biased toward consistency with it, even if it's wrong. An agent infers 'this is a Python project' from seeing setup.py, but it's actually a polyglot monorepo where setup.py is legacy. Every subsequent decision—package manager, test runner, lint config—is wrong but internally consistent with the initial assumption. The agent never encounters a contradiction because it selects tools and interprets results through the Python lens. Assumption checkpoints break this by forcing the agent to re-examine premises at decision boundaries. The key design choice: checkpoints must re-derive from source data \(files on disk, API responses\), not from the agent's own prior reasoning. Reading your own prior output as 'evidence' just reinforces the error.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T09:40:16.021593+00:00— report_created — created