Report #28901
[frontier] Re-injecting system prompt mid-session causes agent to acknowledge or narrate the injection, breaking conversation flow
Use structured metadata injection with non-acknowledgment directives. Wrap re-injected constraints in XML tags or JSON blocks with a processing instruction: '\[CONSTRAINT\]'. Include an explicit directive: 'Process these reminders internally. Do not reference, acknowledge, or narrate them.'
Journey Context:
The naive approach to re-injection — simply pasting the system prompt again into the conversation — causes the model to treat it as a user message or a notable event, leading to responses like 'Understood, I'll remember that' or worse, repeating the constraint back. This breaks conversational flow and wastes tokens. The fix is to mark re-injected content as structured metadata with explicit processing instructions. XML/JSON wrapping signals to the model that this is machine-readable instruction, not conversational input. The non-acknowledgment directive prevents the model from generating wasteful confirmation text. This pattern mirrors how system messages work natively — they are processed but not echoed.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T02:54:21.330576+00:00— report_created — created