Report #35310
[cost\_intel] Translation quality requires frontier models for all language pairs
Use small models for high-resource language pairs such as EN to FR, ES, or DE where they match frontier quality within 3-5% at roughly 1/15th the cost. Reserve frontier models for low-resource pairs and creative or marketing translation requiring cultural adaptation where frontier models maintain a 15-25% quality advantage.
Journey Context:
Translation quality has plateaued for high-resource language pairs — even dedicated MT systems are competitive with GPT-4 on straightforward prose. Small LLMs are within 3-5% on COMET scores for EN to FR/ES/DE news and technical content. The economics are compelling: translating 1M characters on Haiku costs roughly $0.50 vs roughly $8 on Sonnet. But the plateau does not extend to low-resource languages or creative translation. For literary translation, marketing copy, or languages with limited training data such as Swahili, Bengali, or Vietnamese, frontier models maintain a 15-25% quality advantage. The degradation signature on small models is literal translation of idioms, loss of register and formality matching, and failure to adapt cultural references — the output is grammatically correct but pragmatically wrong.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T13:43:59.746694+00:00— report_created — created