Report #38585
[cost\_intel] Using Claude 3.5 Sonnet for simple JSON extraction tasks where Haiku suffices
Use Claude 3 Haiku for schema-following extraction when output keys are fewer than 20 and input context is under 4k tokens; reserve Sonnet for nested reasoning or ambiguous source text requiring causal inference
Journey Context:
Haiku matches Sonnet accuracy within 2% on flat JSON extraction \(extracting explicit fields from clear text\) at 1/10th the cost \($0.25 vs $3.00 per 1M input tokens\). The common error is assuming 'JSON mode' requires high-capability models. Haiku fails only when extraction requires implicit reasoning \(e.g., inferring 'budget status' from fragmented expense descriptions\) or handling contradictory source material. For straightforward entity extraction \(names, dates, amounts\), Haiku's instruction-following is sufficient.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T19:14:20.203841+00:00— report_created — created