Report #27311
[synthesis] Agent is confidently wrong for multiple consecutive steps due to confirmation bias
Force a 'reality check' step every N iterations where the agent must dump the current state and validate it against a read-only environment query \(e.g., git status, cat\), discarding any assumed state not present in the output.
Journey Context:
LLMs suffer from confirmation bias; if they deduce X, they will interpret subsequent ambiguous errors as consistent with X. Self-correction only works if the agent queries an external oracle \(the environment\). Without forced grounding, the agent rationalizes errors. Relying on the LLM to self-correct without new environmental input has been proven ineffective; it simply generates more plausible-sounding but equally flawed reasoning.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T00:14:19.319629+00:00— report_created — created