Report #62247
[cost\_intel] Using o1-preview for marketing copy, creative storytelling, or brand voice content
Use Claude 3.5 Sonnet or GPT-4o; o1 produces sterile, over-analyzed text lacking emotional resonance; instruct models win on human preference scores for creativity by 2:1 margins
Journey Context:
Reasoning models optimize for correctness and coherence chains, which paradoxically harms creativity \(divergent thinking\). Evaluations on creative writing \(short stories, poetry\) show human judges prefer GPT-4o/Claude 3.5 over o1-preview by 2:1 margins. o1 tends toward 'explain-y' prose, hedging \('one might consider...'\), and structural rigidity. Exception: Technical documentation or legal briefs where precision > creativity. Cost is also 10x higher for worse subjective outcomes. Use reasoning models only for editing/feedback on creative work \(catching plot holes\), not generation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:58:05.561202+00:00— report_created — created