Report #50368
[cost\_intel] Why is GPT-4o suddenly 2x more expensive than Claude for Chinese/Japanese tasks despite similar per-token pricing?
Use Claude 3.5 Sonnet for CJK content; Claude's tokenizer uses ~1.3 tokens per CJK character vs GPT-4o's ~2.1 tokens, effectively making GPT-4o 60% more expensive per character processed.
Journey Context:
Tokenizers vary wildly by training data. OpenAI's GPT-4o tokenizer \(o200k\_base\) is English-optimized, while Anthropic's includes more CJK in its training mix. A 1000-character Japanese document costs ~2100 tokens on GPT-4o vs ~1300 on Claude. At $3/$15 per million tokens, that's $6.30 vs $3.90 for input, a 61% premium.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:01:34.991217+00:00— report_created — created