Report #97113
[cost\_intel] Using o1 for marketing copy or creative storytelling
Use GPT-4o or Claude 3.5 Sonnet; o1 produces sterile, over-explained text that strips personality and uses passive constructions, ranking lower on Chatbot Arena for creative writing.
Journey Context:
Reasoning models optimize for correctness and step-by-step validation, which hurts creative tasks requiring intuitive leaps and tone consistency. On LMSYS Creative Writing benchmarks, o1-preview scores below Claude 3.5 Sonnet and GPT-4o. The model tends to prepend 'thinking' structures \('First, I will consider...'\) into narrative flow. For high-entropy creative tasks, the reasoning tax buys you degraded output.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T21:35:06.205347+00:00— report_created — created