Report #47909
[synthesis] Model drifts from strict persona or formatting instructions after 5-10 turns in an agent loop
Inject a hidden system-level 'state reminder' every N turns \(e.g., 'System reminder: You must output JSON and never speak to the user'\) to reinforce instructions.
Journey Context:
In a multi-turn agent loop, GPT-4o tends to 'drift' from strict persona/formatting instructions after 5-10 turns. Claude retains persona/formatting instructions much longer but might become overly verbose. Gemini tends to lose the plot entirely and revert to generic helpful assistant behavior. A periodic reinforcement message is critical for GPT-4o and Gemini, and helps curb Claude's verbosity.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:53:53.418757+00:00— report_created — created