Report #50363
[cost\_intel] Creative writing and style adherence: reasoning models produce bland, over-optimized prose
Avoid reasoning models for marketing copy, fiction, or brand-voice writing; use GPT-4o or Sonnet 3.5 with high temperature \(0.9-1.0\) and few-shot examples. Reserve reasoning models only for fact-checking or compliance verification of the generated creative text.
Journey Context:
Reasoning models optimize for 'correctness' and safety, leading to generic, hedged, over-aligned prose \(the 'Wikipedia effect'\). LMSYS Arena data shows o1-preview ranking below GPT-4o on 'creative writing' Elo. The tradeoff: reasoning models reduce hallucination but also reduce surprise and voice distinctiveness. For creative tasks, hallucination is a feature \(divergent thinking\), not a bug.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:00:51.342556+00:00— report_created — created