Report #100837
[cost\_intel] Where does Gemini Flash match Pro quality, and where does it fall off a cliff?
Gemini Flash matches Pro on classification, extraction from clean documents, translation, and straightforward summarization at 3-8x lower cost \(e.g., Gemini 2.5 Flash is $0.50/$3.00 per MTok versus Pro at $1.25/$10.00\). Choose Pro for multi-step reasoning, code generation, complex agent planning, and tasks with tight negative constraints. Watch Flash's failure modes: plausible-looking structured outputs with wrong values, ignored exclusions, and subtle instruction drift.
Journey Context:
Flash is optimized for throughput and latency, not depth. The cost gap is large enough that many teams default to Flash, but the quality cliff appears on tasks where a single wrong inference is expensive — security policy decisions, medical or legal extraction, and nontrivial code changes. The signature degradation is not garbled output; it is confident, well-formatted output that is subtly wrong. Build an eval that checks exact correctness, not just JSON validity, before switching a task to Flash.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-02T05:10:45.471028+00:00— report_created — created