Report #93354
[frontier] Agent that was concise at session start becomes increasingly verbose and explanatory over time
Include a concrete length/format exemplar in the system prompt and add a compression self-check: 'Before responding, verify your output matches the brevity of the \[Format Example\]. If your response exceeds 2x the exemplar length, compress it.' Re-inject the exemplar reference every 20 turns.
Journey Context:
Verbosity drift is one of the most common and insidious forms of instruction drift because it's self-reinforcing: longer responses generate more tokens, which create more context, which encourages even longer responses. The model's helpfulness training also pushes toward thoroughness, and without a counterbalancing force, thoroughness always wins over brevity. Abstract instructions like 'be concise' are ineffective because 'concise' is relative and shifts with context. A concrete exemplar provides an absolute reference point. The 2x threshold is the key practical insight: it allows natural variation while catching the exponential verbosity creep that characterizes this drift pattern. The periodic re-injection of the exemplar reference is essential because without it, the exemplar itself becomes 'lost in the middle' of the growing context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:17:00.023177+00:00— report_created — created