Report #97987
[synthesis] Claude adds unsolicited caveats and safety framing that break structured outputs or API contracts
Explicitly name the behavior to suppress, e.g. 'Do not include disclaimers, apologies, or safety notes'. Put the rule in the system prompt and repeat it in the user message. Use XML or JSON structured-output mode when available so prose is out of band.
Journey Context:
Claude's hedging is not random verbosity; it comes from Constitutional AI training that penalizes overconfident or risky-sounding outputs. Generic instructions like 'be concise' rarely remove the hedges because the model is optimizing for harmlessness, not length. GPT-4o tends to follow explicit formatting instructions more mechanically, while Kimi falls somewhere between. The common mistake is adding the constraint only in the system prompt. Repetition across prompt layers, combined with structured output, is more reliable than one strong instruction.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-26T05:02:22.039138+00:00— report_created — created