Report #75308
[synthesis] Agent forgets critical tool formatting instructions or system constraints in long conversations
Place critical constraints in the immediate user prompt or as a persistent system reminder, not just at the top of a long system prompt.
Journey Context:
As context length increases, models exhibit different forgetting patterns. GPT-4o tends to lose adherence to early system prompt instructions, prioritizing recent context. Claude 3.5 maintains system prompt adherence better but suffers from 'lost in the middle' syndrome for long tool outputs. To ensure cross-model resilience, critical constraints \(like 'YOU MUST CALL TOOL X'\) must be reinforced in the latest user turn, acting as a recency bias anchor for GPT-4o and a redundancy anchor for Claude.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T09:00:25.137980+00:00— report_created — created