Report #65308
[synthesis] Model ignores system prompt instructions when tool descriptions contradict them
Put critical behavioral constraints in BOTH the system prompt and the tool description for GPT-4o. For Claude, the system prompt is sufficient, but tool descriptions should not contradict it.
Journey Context:
If a system prompt says 'Act autonomously without asking for confirmation' but a tool description says 'Always confirm with the user before executing', models resolve the conflict differently. GPT-4o weights tool descriptions heavily during tool-calling contexts and will ask for confirmation, overriding the system prompt. Claude 3.5 Sonnet weights the system prompt as the overarching persona and will act autonomously, ignoring the tool's confirmation instruction. To ensure consistent cross-model behavior, duplicate the highest-priority instruction into the tool description to satisfy GPT-4o's attention mechanism, while keeping it in the system prompt for Claude.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T16:06:09.052283+00:00— report_created — created