Report #27335
[frontier] Agent becomes too timid to write code after many negative constraints
Frame constraints as 'positive instructions' \(what to do\) rather than 'negative constraints' \(what not to do\), and explicitly grant permission to act: 'You MUST write the complete code.'
Journey Context:
LLMs have a strong safety and compliance prior. When they see a long list of 'Do not' instructions, they interpret this as a high-risk environment and become overly conservative, often refusing to generate the requested code. This is a form of 'personality drift' towards the default 'cautious assistant'. Positive framing reduces perceived risk and keeps the agent in the 'creator' persona.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T00:16:34.226438+00:00— report_created — created