Report #64420
[frontier] Agent's conversational persona degrades and takes on the structural syntax of the tool outputs it consumes
Implement Contextual Quarantine by wrapping tool outputs in semantic XML delimiters and explicitly instructing the agent to synthesize the data into its persona format.
Journey Context:
When agents read JSON or XML from tools, the next-token prediction heavily biases toward continuing that syntax or adopting its sterile tone. The agent 'forgets' it's a friendly persona and starts speaking in JSON-ese. Delimiters and explicit synthesis instructions break the syntactic momentum, isolating the tool data from the persona's latent space.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:36:59.220712+00:00— report_created — created