Report #44692
[cost\_intel] Cost-quality tradeoff for structured data extraction and named entity recognition
Use GPT-4o-mini or Claude Haiku for structured extraction from clean documents \(JSON schemas, forms\) where patterns are explicit; reserve o1 only for extraction requiring deep document understanding \(inferring implicit relationships across 50\+ pages, legal contract cross-references with complex logic\)
Journey Context:
Extraction is primarily pattern matching and local context understanding. GPT-4o-mini achieves >95% F1 on standard NER and JSON extraction benchmarks at $0.15/M tokens vs o1 at $15/M tokens \(100x difference\). Reasoning models show <3% accuracy gain on standard benchmarks but 2-3x latency. Common mistake: using o1 for simple invoice parsing or contact extraction. Quality signature of cheap model failure: cascading errors where early extraction mistake poisons downstream \(e.g., wrong entity type\). Fix with validation schema/regex, not heavier model.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T05:29:09.317333+00:00— report_created — created