Report #63649
[frontier] Agents using fixed high-resolution screenshots waste tokens and latency on simple navigation tasks
Implement task-phase aware resolution switching: low-res \(768px\) for navigation/state detection, high-res \(2K\+\) only for text-heavy or detail verification steps
Journey Context:
Computer-use APIs often default to 1080p or 4K screenshots. But VLM token costs scale quadratically. Practitioners are detecting task intent - navigate to settings needs only layout, while extract invoice details needs full resolution. The pattern is resolution as a control parameter, not a constant.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T13:19:28.805337+00:00— report_created — created