Report #35934
[cost\_intel] Where does Gemini 1.5 Flash match Pro on structured data extraction tasks
Use Flash for extraction with Pydantic schemas less than 10 fields and input less than 10k tokens; matches Pro within 4 percent accuracy at 1/20th cost
Journey Context:
Teams assume Flash is unusable for extraction due to 'lower quality' branding, but for constrained structured generation \(JSON mode with strict schemas\), Flash achieves 96 percent of Pro's extraction accuracy on invoices, receipts, and forms. The failure mode is schema violation on edge cases \(handwritten text, unusual layouts\), where Pro recovers via reasoning. Flash requires explicit schema constraints in the prompt to prevent hallucination of default values. Cost difference: Flash $0.075/1M tokens vs Pro $1.25/1M input tokens \(16.7x cheaper\). For extraction pipelines processing 100k docs/day, Flash is the dominant strategy with human-in-the-loop review for low-confidence scores.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T14:47:15.565608+00:00— report_created — created