Report #37966
[cost\_intel] Summarizing standard business documents \(contracts, reports\) under 20k tokens where extractive summarization suffices
Use GPT-4o-mini or Claude 3 Haiku with constrained JSON schemas and regex validation. Reserve reasoning models for documents with heavy ambiguity \(handwritten notes, contradictory information\). Implement field-level confidence thresholds to route only low-confidence extractions to reasoning models.
Journey Context:
Structured extraction is pattern matching; reasoning adds little value. GPT-4o-mini achieves 95%\+ F1 on standard invoice datasets at $0.001/page vs $0.05/page for reasoning models. The failure mode is 'hallucinated values' when OCR quality is poor \(mitigated by validation rules\). Quality degradation appears as format errors \(easily caught by schema validation\) vs semantic errors \(rare in structured tasks\). The signature is 'date in wrong format' vs 'wrong date'.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T18:12:06.749832+00:00— report_created — created