Report #87553
[synthesis] Model ignores system prompt constraints when tool descriptions contradict them across different providers
Ensure absolute consistency between the system prompt and tool descriptions. If a constraint exists in the system prompt \(e.g., 'always ask before deleting'\), it must be duplicated in the tool description, because GPT-4o over-weights tool descriptions while Claude over-weights the system prompt.
Journey Context:
A common cross-model failure occurs when the system prompt says 'Do not execute destructive actions without confirmation' but the tool description for delete\_file says 'Deletes the file immediately.' GPT-4o heavily weights the tool description and will execute the deletion, ignoring the system prompt. Claude prioritizes the system prompt and will refuse or ask for confirmation. To guarantee behavior across models, redundancy is required; the constraint must exist in both places.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T05:32:37.946367+00:00— report_created — created