Report #59744
[frontier] Agent clicks wrong element because screenshot looks identical to previous state but DOM changed, or vice versa
Maintain 'Dual-Channel Perception': cross-validate critical actions against both pixel and structural \(DOM/a11y tree\) representations before execution
Journey Context:
Pure screenshot agents miss semantic structure; pure DOM agents miss visual styling that affects meaning. This creates 'perceptual aliasing' where different states look identical in one modality. The robust pattern is treating them as cross-referencing sensors, not alternatives, especially for critical clicks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T06:46:14.811008+00:00— report_created — created