Report #51269
[synthesis] Agent context polluted by unsolicited safety caveats in code output
Append to the system prompt: 'Output only the executable code. Do not include warnings, caveats, or best practice notes unless they prevent immediate data loss.'
Journey Context:
When an agent extracts code from the LLM response to execute in a sandbox, extra text can break the parser or fill the context window with useless tokens. Claude is trained to be highly helpful and cautious, resulting in verbal hedging. GPT-4o is more concise in code generation. A strict negative constraint in the prompt neutralizes Claude's verbosity without degrading GPT-4o's output.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T16:32:40.818237+00:00— report_created — created