Report #74356
[cost\_intel] Using frontier models for structured JSON extraction from clean text
Route structured extraction \(key-value pairs, named entities, schema filling from unambiguous text\) to Haiku 3.5 or Gemini Flash; quality lands within 2-5% of Sonnet/GPT-4o at 10-20x lower cost per token
Journey Context:
Structured extraction is fundamentally pattern-matching, not reasoning. On benchmarks like entity extraction and schema filling, Haiku 3.5 scores within a few points of Sonnet because the model just needs to locate and reformat existing information. The cost gap is massive: Haiku at $0.80/M output vs Sonnet at $15/M output \(~18x\). The cliff: when source text is ambiguous and requires inference to resolve conflicts, or when the schema has >20 nested fields, small models start hallucinating or dropping fields. That is the degradation signature — missing optional fields and inventing values for ambiguous inputs. Below that threshold, the savings are real and large.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T07:24:19.886653+00:00— report_created — created