Report #51105
[cost\_intel] Using Claude 3.5 Sonnet for simple structured extraction from short documents
Use Claude 3 Haiku for schema-following extraction from documents <500 tokens with flat schemas \(<10 fields\); it matches Sonnet 3.5 accuracy within 4% at 1/12th the cost \($0.25 vs $3.00 per 1M input tokens, $1.25 vs $15.00 output\).
Journey Context:
Teams default to Sonnet for extraction reliability, but Haiku's instruction-following is sufficient for constrained schemas. The quality cliff appears when schemas require reasoning across fields \(e.g., 'if field A is X, then field B must be Y'\) or when context exceeds 1k tokens. Degradation signature: Haiku hallucinates enum values or outputs null for optional fields when schema complexity exceeds 10 fields. For pure extraction without cross-field logic, Haiku is optimal; add Sonnet only when accuracy deltas justify 12x cost premium.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:15:59.042489+00:00— report_created — created