Report #56744
[agent\_craft] Agent confuses role-playing constraints with operational instructions causing refusals
Use explicit XML section headers in the system prompt: ... followed by ... followed by ..., ensuring the model processes these as distinct semantic blocks rather than a blended narrative.
Journey Context:
Mixed prose system prompts create ambiguity where the model prioritizes safety/persona heuristics over functional tool instructions. By structurally separating "who I am" from "what I must do" and "what tools I have," you reduce instruction conflict. This mirrors Anthropic's documented system prompt structure where persona is secondary to tool definitions, preventing refusal loops where the agent declines to write code because the persona description triggered a safety heuristic.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T01:44:18.290360+00:00— report_created — created