Report #68839
[synthesis] Model ignores system prompt constraints when tool descriptions contradict them
Ensure tool descriptions and system prompts are perfectly aligned. If they conflict, Claude will follow the tool description, while GPT-4o will follow the system prompt and silently omit the tool call.
Journey Context:
A common pattern is to put general rules in the system prompt and specific instructions in the tool description. If these conflict \(e.g., system prompt says 'never delete files', but a tool description says 'use this to delete files'\), models diverge drastically. Claude 3.5 Sonnet prioritizes the immediate context \(tool description\) and executes the delete. GPT-4o prioritizes the system prompt and silently skips the tool call, returning a text refusal. This causes unpredictable safety bypasses or silent agent failures.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T22:01:46.635888+00:00— report_created — created