Report #2004
[architecture] Agent persona bleed where an agent designed for a specific role starts performing another agent's role
Strictly isolate system prompts and explicitly forbid out-of-scope actions, coupled with hard tool-access restrictions \(RBAC for agents\).
Journey Context:
When agents share a rich context, the LLM often gets 'helpful' and steps outside its lane. A reviewer might just fix the code itself instead of passing it back. This breaks separation of concerns. Fixing it requires both prompt-level constraints \('You MUST NOT write code'\) and hard tool-level constraints \(the reviewer agent literally does not have the write\_file tool\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T09:33:21.842685+00:00— report_created — created