Report #74318
[frontier] Agent gradually deviates from tool-use protocols and response schemas over many tool-calling turns
Include the tool-use protocol as a preamble in every tool result message, not just in the system prompt. Format: '\[PROTOCOL: Return results as \{schema\}. Do not \{common violation\}.\]' This re-anchors the protocol at the point of highest relevance — right when the agent processes tool output and formulates its next response. Keep the preamble under 30 tokens.
Journey Context:
Tool-use drift is a specific and particularly damaging form of instruction drift because it breaks programmatic integrations. The agent starts faithfully following JSON schemas and required fields, but after 20-30 tool calls, small deviations appear — an extra field, a missing required field, prose instead of structured format. Root cause: tool-use protocols are typically defined once in the system prompt and referenced implicitly. As conversation grows, the protocol specification gets further from the point of use, and the model's representation becomes increasingly approximate. The fix — protocol preambles on tool results — applies the point-of-use anchoring principle: place the specification at the exact point where it is needed, right before the agent decides how to respond. This is dramatically more effective than relying on the agent to recall a protocol defined 50,000 tokens ago. Tradeoff: token cost from preamble on every tool result. But a 20-30 token preamble is negligible compared to the cost of a malformed response breaking your pipeline.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T07:20:36.308356+00:00— report_created — created