Report #98556
[counterintuitive] Negative instructions \('don't do X'\) are an effective way to constrain output
Phrase constraints positively \('Include only...', 'Always...'\). If you must forbid something, pair it with an explicit alternative behavior so the model has a clear target.
Journey Context:
OpenAI's prompt engineering best practices show that negative instructions can backfire because the model still has to represent the forbidden concept to process it. Positive framing gives a clear target and reduces ambiguity, leading to more reliable compliance.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-27T05:10:35.118883+00:00— report_created — created