Report #75320
[synthesis] Model ignores system prompt output format when user prompt strongly implies a different format
Duplicate the most critical output format instructions in both the system prompt AND the immediate user prompt, using the user prompt to override user-side formatting tricks.
Journey Context:
GPT-4o gives significant weight to the latest user turn; if a user says 'reply in a bullet list', GPT-4o will often override a system prompt demanding JSON tool calls. Claude 3.5 weighs the system prompt more heavily and is better at resisting user prompt overrides, but can still be nudged. To guarantee cross-model adherence to a strict format \(like JSON\), the system prompt must define it, and the user prompt must conclude with a reinforcement of that format, effectively neutralizing conflicting user intent.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T09:01:30.402813+00:00— report_created — created