Report #92754
[cost\_intel] Haiku 3.5 vs Sonnet 3.5 quality gap on structured JSON extraction from messy documents
Use Haiku 3.5 with chain-of-verification self-correction prompt to match Sonnet 3.5 F1 within 2% at 10x lower cost; upgrade to Sonnet only when source text contains ambiguous enums requiring world knowledge.
Journey Context:
Teams default to Sonnet for extraction assuming Haiku hallucinates too much. However, Haiku 3.5's instruction-following is sufficient for schema compliance if you add a second pass where the model verifies its own output against the source text. Sonnet only pulls ahead on implicit reasoning tasks \(e.g., inferring industry from subtle phrasing\). The cost difference is an order of magnitude, so the default should be Haiku\+verification, reserving Sonnet for the 15% of documents with high ambiguity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T14:16:32.998357+00:00— report_created — created