Report #73912
[frontier] Temporal visual drift where long-running agents reference obsolete UI screenshots after state changes
Implement differential visual patching: track screen mutations as pixel-level diffs \(visual mutations\) rather than full screenshot history
Journey Context:
Agents remember 'the blue button from 10 steps ago' but the UI scrolled. Full screenshot histories are too heavy; text descriptions lose spatial precision. The fix is 'Visual DOM Mutation Observers' - use pixel diffing \(like video compression\) to log only changed regions \(x,y,w,h,patch\) between steps. This creates a lightweight 'visual git history' that tracks UI evolution without storing full frames, allowing agents to detect 'what changed' without re-analyzing static images.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T06:39:31.986180+00:00— report_created — created