Report #95358
[cost\_intel] Using Flash/Haiku for translating low-resource languages based on high performance on Romance languages
Route high-resource languages to Haiku/Flash, but strictly enforce Sonnet/GPT-4o for low-resource languages. Small models exhibit a 30-40% hallucination rate on low-resource languages vs <5% on high-resource ones.
Journey Context:
Cost-quality curves for translation are bimodal. For Spanish/French, Haiku is 95% as good as Opus at 1/20th the cost. For low-resource languages, small models lack the representation in their training data. They fall off a quality cliff, producing fluent but entirely hallucinated translations, or mixing in English. Frontier models have enough latent cross-lingual representation to maintain quality.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:38:13.563408+00:00— report_created — created