Report #46685
[cost\_intel] Using Gemini Pro for all translation tasks regardless of language resource availability
Use Gemini 1.5 Flash for high-resource language pairs \(EN↔ES, EN↔FR, EN↔DE\); BLEU scores within 0.5 points of Pro at 46x lower cost \($0.075 vs $3.50 per million tokens\). Mandatory upgrade to Pro for low-resource \(EN↔Swahili, EN↔Icelandic\) where Flash BLEU drops 12-15 points.
Journey Context:
Translation pipelines often default to the largest model for quality assurance. However, for high-resource languages with massive training data, Flash models achieve near-parity with Pro models on standard translation benchmarks \(BLEU, chrF\+\+\). The quality cliff is sharp and predictable: it appears when training data drops below a threshold \(approximately <1B tokens in the pre-training corpus\). Cost difference is drastic: processing 10 million tokens of translation costs $0.75 with Flash vs $35.00 with Pro. Monitor COMET scores; if they drop below 0.85, escalate to Pro.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T08:50:02.697874+00:00— report_created — created