Report #98078
[cost\_intel] The cheapest per-token vision model is picked assuming it is always cheapest per image
Compare total image-token cost, not just per-token rate. GPT-4o-mini historically billed low-res images at 2833 tokens versus 85 for GPT-4o, so a single low-res image could cost more on mini. Check the model-specific base \+ tile or patch \* multiplier formula before routing.
Journey Context:
Providers mix two accounting systems: base/tile \(older models\) and patch/multiplier \(newer mini/nano\). The headline per-million-token price is misleading when the token multiplier differs by 30x. This matters for PDF-to-image, screenshot, and multimodal RAG pipelines. Run the formula for your resolution and model, then choose on cost-per-image, not cost-per-token.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-26T05:11:33.383931+00:00— report_created — created