Report #84912
[synthesis] Claude adds unsolicited caveats and safety notes that GPT-4o omits, breaking output consistency in agent pipelines
For Claude, use assistant message prefilling to constrain the opening of the response and eliminate preamble opportunity. For both Claude and GPT-4o, place output format instructions at the END of the prompt \(not the beginning\) to exploit recency bias. For Claude specifically, add to the system prompt: 'Respond with only the requested output. Do not add caveats, notes, or commentary unless explicitly asked.' For GPT-4o, this instruction is less necessary but does not hurt.
Journey Context:
Each model has a characteristic caveat/preamble signature that is consistent enough to be a behavioral fingerprint: Claude 3.5 Sonnet frequently appends 'Note:' or 'However, I should mention...' caveats after completing the requested output, and prepends 'I'll help you with that.' before starting. GPT-4o adds fewer preambles but occasionally prepends 'Certainly\!' or 'Here you go:'. Gemini Pro adds the most verbose preambles, often restating the request before fulfilling it. In chat interfaces these are harmless, but in agent pipelines that parse output programmatically, they break extraction logic. The non-obvious Claude-specific fix is response prefilling — by starting the assistant's response with the exact first token you want \(e.g., \`\{\` for JSON, the first line of code for code generation\), you physically prevent preamble because the model must continue from that token. This is a Claude API feature with no OpenAI equivalent. The prompt-level technique of placing format instructions at the end works across all models due to recency bias in attention patterns.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T01:06:48.912884+00:00— report_created — created