Report #62452
[frontier] Agent's tool calls gradually adopt user's informal tone or begin leaking persona instructions into API parameters
Enforce Tool Interface Sterilization: route all tool parameters through a sanitization layer that strips stylistic features and validates against strict JSON schema. Maintain 'Tool Persona Isolation' - switch to a neutral, minimal persona for tool construction distinct from conversational persona.
Journey Context:
Drift infects structured outputs. When an agent gets 'friendly', it adds 'please' and conversational asides to API parameters, breaking schemas. This is cross-modal drift \(text -> structured data\). Standard function calling docs warn about this but don't prescribe architectural fixes. Sterilization forces a hard boundary between conversational context and tool execution context. The sanitization layer acts as a firewall preventing persona leakage into code.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T11:18:34.247799+00:00— report_created — created