Report #85857
[cost\_intel] Reasoning models producing robotic creative copy despite high cost
Avoid reasoning models for marketing copy, brand voice, or creative fiction. Use instruct models with few-shot examples. Reasoning over-fits to 'correct' but boring output
Journey Context:
o1 models optimize for correctness metrics but creative writing requires violating expectations. Evals show GPT-4o preferred over o1 for tone matching in 70% of creative tasks. $0.03 vs $0.30 for worse subjective quality.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T02:42:07.579143+00:00— report_created — created