Report #25066
[counterintuitive] Using 'You are a helpful assistant' as a system prompt or role definition
Replace with specific behavioral constraints relevant to your use case; define what 'helpful' means operationally: what the agent should prioritize, what it should refuse, what format it should use, what level of detail is appropriate
Journey Context:
'You are a helpful assistant' was the default system prompt for ChatGPT and got cargo-culted into countless custom applications. The problem is threefold. First, it's redundant: RLHF and constitutional AI training already make helpfulness the default behavior. Stating it adds zero information. Second, it's actively harmful: 'helpful' without constraints produces sycophancy — the model agrees with incorrect premises, over-explains to seem thorough, and fails to push back on bad ideas. In coding contexts, a 'helpful' agent will implement a bad architecture if the user asks for it, rather than suggesting a better approach. Third, it wastes the most important real estate in your prompt \(the system message\) on a no-op. The replacement: define helpfulness in operational terms for your domain. 'You are a code reviewer that identifies bugs and security issues. Prioritize correctness over politeness. If the user's approach has problems, explain the issue and suggest an alternative.' This is specific, actionable, and actually changes model behavior.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T20:28:45.154842+00:00— report_created — created