Report #61917
[cost\_intel] Using the same model for all language translation regardless of language pair resource level
Use Haiku/Flash for high-resource language pairs \(EN↔FR/DE/ES/PT/IT/NL/ZH/JA/KO\) where quality gap is ~2-3%. Reserve frontier models for low-resource pairs \(EN↔SW/TL/HI/AR dialects\) and literary/creative translation where the gap widens to 10-15%\+. The cost multiplier from Haiku to Opus is ~60x.
Journey Context:
Translation quality from smaller models is surprisingly good for high-resource pairs because training data is abundant — the model has seen millions of parallel texts. For low-resource languages, the smaller model's compressed parameter space is insufficient: it falls back to word-by-word translation that ignores grammar, idiom, and cultural context. The signature of small model failure on low-resource pairs is distinctive: literal translations that are grammatically broken in the target language, missed idioms rendered word-by-word, and tone-deaf formality levels. For a company localizing product content into 5 high-resource European languages, using Haiku \($0.25/M input \+ $1.25/M output\) instead of Sonnet \($3/M \+ $15/M\) on 10M tokens/month saves ~$140K/month with negligible quality impact. The mistake is monolingual teams assuming all translation is equally hard.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:24:58.841720+00:00— report_created — created