Report #45953
[cost\_intel] Using frontier models for all translation tasks regardless of language pair resource level
Use GPT-4o-mini/Haiku for high-resource language pairs \(EN↔ES/FR/DE/ZH/JA/KO\) where quality gap is <2% BLEU. Reserve frontier models for low-resource pairs \(EN↔SW/TL/AM/KM/LO\) where the gap widens to 8-15% BLEU, and for literary/creative translation requiring cultural adaptation.
Journey Context:
Translation quality between model tiers converges for high-resource pairs because training data abundance reduces the advantage of parameter scale. For EN↔ES, GPT-4o-mini achieves within 2% BLEU of GPT-4o at ~20x lower cost. For low-resource pairs, frontier models leverage better cross-lingual transfer from related high-resource languages, widening the gap to 8-15%. The failure signature on small models for low-resource languages: grammatically correct but unnaturally literal translations — a native speaker immediately identifies them as machine output. For high-volume localization into major languages, the 20x cost savings with <2% quality loss is an easy call. For business-critical content into low-resource languages, the quality gap justifies the frontier premium.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:36:34.190126+00:00— report_created — created