Report #64599
[synthesis] Agent loses system prompt constraints in long conversations or under user pushback
For GPT-4o, inject periodic reminders of core constraints every N turns or before critical tool calls. For Claude, avoid over-constraining the system prompt with rules that might need contextual exceptions, as it will refuse valid edge cases.
Journey Context:
A common mistake is writing one massive system prompt for all models. With GPT-4o, long system prompts get 'lost' in the attention mechanism as the context window fills. With Claude, every line in the system prompt is treated as unbreakable law. The synthesis is that system prompts must be dynamically managed: GPT-4o needs reinforcement, Claude needs precision and exception handling.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:54:52.298527+00:00— report_created — created