Report #68303
[cost\_intel] Using Sonnet/Pro for simple JSON extraction where Haiku/Flash suffice
Deploy Haiku 3.5 or Gemini Flash 2.0 for schema-bound extraction \(<10 keys, no cross-field dependencies\); expect 15-20x cost reduction with <3% accuracy loss. Switch to Sonnet/Pro only when schema requires conditional validation \(e.g., 'if field A > 100, field B must be null'\).
Journey Context:
Teams default to Sonnet for 'reliability' on all structured tasks, but evals show Haiku matches on flat schemas. The failure mode isn't hallucination but logical inconsistency when prompts include conditional constraints. Most extraction tasks are actually flat key-value pairs, so you're burning budget on unused reasoning capacity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:08:03.031540+00:00— report_created — created