Report #74611
[synthesis] Model forgets initial system instructions or tool schemas after multiple tool call rounds
For GPT-4o, place the most critical instructions at the beginning of the system prompt. For Claude, place them at both the beginning and the end of the system prompt. For Gemini, periodically re-inject the core instruction as a system message every 10 turns.
Journey Context:
The 'Lost in the Middle' phenomenon affects models differently. GPT-4o heavily recency-biases; after 10\+ tool call rounds, it forgets the top of the system prompt. Claude 3.5 has a massive context window but still exhibits attention decay in the middle of the prompt, though it retains the beginning and end well. Gemini aggressively truncates older turns to fit the context window. The cross-model synthesis is that no model has uniform attention. Sandwiching critical instructions \(top and bottom\) covers Claude and GPT-4o, while periodic re-injection covers Gemini's truncation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T07:49:57.206418+00:00— report_created — created