Report #92917
[cost\_intel] Using frontier models for simple JSON extraction where Haiku/Flash suffice
Route flat-schema extractions \(under 5 fields, no deep nesting\) to Claude 3.5 Haiku or Gemini 1.5 Flash; reserve Sonnet/Pro for nested or relational schemas.
Journey Context:
Haiku/Flash are 50-100x cheaper per token than frontier models. For flat schemas, quality is within 1-2%. The cliff happens on nested objects or when entity resolution requires deep context understanding. Watch for 'null' injections or hallucinated keys on smaller models as the degradation signature.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T14:32:56.630393+00:00— report_created — created