Report #98077
[cost\_intel] All Gemini traffic defaults to Pro because Flash is assumed to be low quality
Use Gemini Flash for coding, agentic MCP workflows, summarization, and classification; Google's own benchmark table shows Flash often ties or beats Pro on Terminal-bench, MCP Atlas, OSWorld, and MMMU-Pro while staying far cheaper. Reserve Pro for tasks where the highest reasoning scores or the largest 2M context window are worth the premium.
Journey Context:
Flash is explicitly positioned as near-Pro quality at a fraction of the cost, with a 1M context window and higher throughput. The exact price gap is large \(e.g., Gemini 1.5 Flash input $0.07/M vs Pro $1.25/M\). The signature that Pro is worth it is when errors are expensive or the task requires graduate-level scientific reasoning; otherwise Flash is the pragmatic default.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-26T05:11:32.049603+00:00— report_created — created