Report #71458
[frontier] Agents waste tokens and latency sending full-resolution screenshots for simple text tasks, or miss critical details when over-compressing complex UIs
Implement adaptive quality tiers: use 'low' detail/resolution for text-heavy static pages, 'high' detail only for visually dense UIs \(maps, charts\), and 'auto' based on DOM complexity heuristics \(count of canvas/img elements\)
Journey Context:
OpenAI and Anthropic APIs charge differently for image tokens based on detail/scale. Always sending high-res wastes money; always sending low-res misses buttons. The pattern detects UI complexity via DOM metrics \(canvas count, image count, CSS complexity\) to select appropriate vision detail level.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T02:31:22.983062+00:00— report_created — created