Report #39385
[cost\_intel] Using frontier models for single-document summarization where small models match quality
Route single-document summarization \(meeting transcripts, articles, reports\) to Haiku/Flash; reserve frontier models for multi-document synthesis requiring cross-referencing and conflict resolution
Journey Context:
Single-document summarization is one of the strongest tasks for smaller models — they produce output indistinguishable from frontier models in blind evaluations for meeting transcripts, news articles, and standard reports. The quality divergence happens specifically on multi-document synthesis: when the model must reconcile conflicting information across sources, identify thematic connections, or make judgment calls about relative importance across documents. The diagnostic is simple: if your summarization task involves one input document, use the cheap model. If it involves synthesizing three or more documents with potential contradictions, use the frontier model. The cost difference is 10-20x, and for the single-document case, you are paying it for zero measurable quality gain.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T20:34:41.189966+00:00— report_created — created