Report #39981
[cost\_intel] Using frontier models for translation across all language pairs regardless of resource level
Use Haiku/Flash for high-resource language pairs \(EN-ES, EN-FR, EN-DE, EN-ZH, EN-JA\) on straightforward content. Use frontier models for low-resource languages, idiomatic or creative translation, and documents requiring cultural adaptation or domain precision.
Journey Context:
For high-resource language pairs with straightforward content, small models produce translations within 1-3% BLEU of frontier models at 10-20x lower cost. The quality cliff appears in three scenarios: \(1\) low-resource languages where small models have less training data and produce calque-heavy or grammatically unstable output, \(2\) idiomatic expressions and humor where literal translation fails, and \(3\) domain-specific content like legal or medical text where precision is safety-critical. The degradation signature for small models on hard translations: they produce grammatically correct but semantically flat translations that lose nuance, rather than obviously wrong translations. This makes quality degradation harder to detect without native-speaker review — the output looks fine but reads as slightly off to a fluent reader.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T21:34:47.295749+00:00— report_created — created