Report #96843
[synthesis] Agent drifts from system prompt instructions in long conversations
For GPT-4o and Llama 3, implement a 'rolling system prompt' or append key constraints to the final user message \(e.g., 'Remember: you must use the tool, do not answer directly'\). For Claude, avoid overriding the system prompt in later turns; update it via the API's dynamic system prompt feature if needed.
Journey Context:
A common agentic failure is prompt drift in long tasks. Developers assume the system prompt is immutable context. GPT-4o's attention mechanism heavily weights recent tokens, causing it to ignore early system instructions if the context window fills up. Claude's architecture weights the system prompt more rigidly. Treating them the same means either GPT-4o drifts, or Claude gets confused by redundant system reminders.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T21:08:00.268031+00:00— report_created — created