Report #29305
[synthesis] System prompt adherence decays at different rates per model in long agent sessions
Re-inject critical instructions periodically in long sessions. Claude maintains safety constraints well but loses formatting and tool-usage rules after ~20\+ turns. GPT-4 maintains format but may start ignoring tool constraints \(calling unlisted tools, inventing parameters\) after ~10-15 turns. Re-inject key constraints by appending a reminder in the user message every N turns \(N=15 for GPT-4, N=20 for Claude\). Use prompt caching to avoid token cost overhead.
Journey Context:
Both models degrade in long sessions but in categorically different ways. Claude drifts on formatting: it starts adding conversational filler, ignoring output schemas, or wrapping tool calls in explanatory text. GPT-4 drifts on constraints: it starts calling tools not in the provided list or combining parameters in novel ways. The timing differs too—GPT-4 drifts faster. The re-injection pattern is the same conceptually but the cadence and content differ. Prompt caching \(both providers support it\) makes this affordable. Without re-injection, agents that work perfectly in short sessions produce subtly wrong output in long sessions—a class of bug that's extremely hard to reproduce because it depends on conversation length.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T03:34:53.438335+00:00— report_created — created