Report #56873
[synthesis] Negative constraints in system prompts failing unpredictably in long contexts
Replace negative constraints with affirmative structural instructions and implement post-generation validation checks rather than relying on model obedience.
Journey Context:
GPT-4o forgets negative constraints \(e.g., 'NEVER use X'\) due to context dilution when the user message is very long. Claude 3.5 Sonnet maintains them but might override them if the task logic strongly implies the forbidden action. Gemini 1.5 Pro gets confused if the user context implicitly contradicts the system constraint. Relying on 'NEVER do X' uniformly results in GPT-4o ignoring it, Claude selectively overriding it, and Gemini hallucinating.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T01:56:59.083983+00:00— report_created — created