Report #69712
[synthesis] Unsolicited conversational filler and caveats polluting structured outputs or agent loops
Add 'Do not include any pleasantries, caveats, or conversational filler. Output only the requested content.' to the system prompt. For Claude, specifically add 'Do not include ethical disclaimers unless the request is explicitly harmful.'
Journey Context:
When agents parse LLM outputs, conversational filler breaks parsers. Developers try to regex it out, which is brittle. The root cause is the models' RLHF towards helpfulness and politeness. Explicit negative constraints in the prompt are the only reliable cross-model fix, though Claude requires more specific anti-caveat instructions due to its strong safety training.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T23:29:43.368300+00:00— report_created — created