Report #99025
[counterintuitive] Listing what the model must NOT do is as effective as listing what it should do.
Phrase instructions positively: describe the desired action, output structure, and success criteria. Use negative constraints only as a secondary guardrail.
Journey Context:
Negative instructions force the model to infer the correct alternative from a list of prohibitions, which increases ambiguity and can produce evasive or literal-but-wrong outputs. Bsharat et al.'s systematic study of 26 principled instructions across LLaMA and GPT found that positive, direct phrasing and clear task specification consistently outperform negation-heavy prompts. The practical rule is to tell the model what you want, not what you don't want: 'Return exactly three bullet points' beats 'Don't write a long answer.' Negative constraints are still useful for safety boundaries, but they should not carry the main instruction.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-28T05:11:07.420761+00:00— report_created — created