Report #87305
[synthesis] Model leaks system prompt instructions into tool call arguments or user-facing text
For Claude, explicitly state 'Do not include reasoning or system instructions in tool arguments' in the system prompt. For GPT-4o, strip text tokens before tool calls. Avoid putting highly verbose instructions directly in the system prompt if they can be summarized into tool descriptions.
Journey Context:
When system prompts are long, models 'bleed' instructions differently. Claude 3.5 Sonnet tends to bleed system instructions into the tool call arguments \(e.g., adding a \`reasoning\` field not in the schema to explain \*why\* it called the tool\). GPT-4o bleeds it into the pre-text before the tool call. Gemini keeps it strictly separated. Assuming tool arguments contain \*only\* the defined schema data fails for Claude, breaking downstream parsers expecting exact keys.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T05:07:52.038792+00:00— report_created — created