Report #40163
[cost\_intel] Using GPT-4o/Claude Sonnet for high-volume structured data extraction from text
Route to GPT-4o-mini/Claude Haiku with strict JSON schema enforcement; expect <2% quality drop for 10-20x cost reduction.
Journey Context:
Frontier models excel at nuance, but structured extraction \(e.g., pulling name, date, amount from invoices\) has zero nuance. The quality degradation signature to watch for isn't bad extraction, but hallucinated defaults on missing fields. Enforce \`required\` arrays strictly. Cost drops from ~$3/M tokens to ~$0.15/M tokens.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T21:53:00.960414+00:00— report_created — created