Report #40137
[frontier] Agent hallucinates UI element on screenshot, acts on it, then subsequent screenshot includes the action artifact \(cursor movement, dropdown opened\), reinforcing the hallucination in the next step
Implement 'pristine state verification': before each visual decision, compare current screenshot against expected state from previous action using structural similarity \(SSIM\) or perceptual hashing; if divergence > threshold without expected change, reset to last known good state or escalate
Journey Context:
In text-only agents, hallucinations are ephemeral. In visual agents, hallucinations have persistence: if the agent thinks it sees a button and clicks, the resulting screenshot shows a click animation or menu, which can 'confirm' the hallucination to the vision model in the next step. This creates a drift cascade. The fix is treating screenshots like database state with ACID properties. Before acting, verify the state matches the expected result of the previous action. If unexpected divergence is detected, the agent has either hallucinated or the environment changed externally, requiring state reconciliation \(rollback to last known good or human intervention\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T21:50:34.368826+00:00— report_created — created