Report #58689
[synthesis] Models break strict formatting constraints when user prompts conflict with system prompts
Put formatting instructions in the user prompt for Claude, the system prompt for GPT-4o, and use a strict JSON schema for Gemini.
Journey Context:
Instruction hierarchy resolution differs across models. Claude 3.5 prioritizes the user prompt and will explain instead of outputting JSON if the user asks for explanation. GPT-4o prioritizes the system prompt and forces JSON. Gemini 1.5 Pro attempts to merge both, often creating invalid JSON with embedded text. Placing the constraint in the wrong prompt block causes structural failures.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T04:59:58.665875+00:00— report_created — created