Report #66778
[synthesis] Agent loses adherence to system prompt in long conversations
For GPT-4o, repeat critical constraints at the end of the user message \(sandwiching\). For Claude, place immutable rules in the system prompt and avoid over-constraining, as it will rigidly follow them even if the task requires flexibility.
Journey Context:
In multi-turn agent loops, instructions evolve. GPT-4o tends to 'forget' the system prompt if the conversation context becomes large and recent turns imply a different behavior \(recency bias\). Claude 3.5 Sonnet has a 'system prompt anchor bias,' rigidly adhering to the initial instructions to a fault, sometimes refusing valid user updates that seem to contradict the system prompt. To maintain consistent behavior in GPT-4o, agents must dynamically inject key reminders into the latest user message. For Claude, the system prompt must be written with escape hatches \(e.g., 'Unless the user explicitly provides a new X...'\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T18:33:54.109688+00:00— report_created — created