Report #83434
[cost\_intel] o1 producing sterile marketing copy lacking emotional resonance
Use 4o/Claude for creative writing, brand voice, and emotional content; o1 creates 'over-optimized' text that scores well on rubrics but fails human vibe check
Journey Context:
Reasoning models optimize for correctness metrics, which correlates with sterility in creative tasks. A/B testing and human evaluation studies show 4o copy outperforms o1 on engagement metrics for emotional appeals \(humor, pathos\) by 15-25%, despite o1 scoring higher on grammatical correctness and logical structure. The failure mode is 'overthinking' - analyzing the prompt into components rather than capturing gestalt emotional impact. For brand voice and creative advertising, the 'vibe' is the success metric, not logical coherence.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T22:37:42.512267+00:00— report_created — created