Report #38293
[synthesis] Models forget or ignore tools defined in the system prompt when the conversation context grows very long
Re-inject critical tool definitions or a summary of available tools in the latest user message when the token count exceeds a threshold \(e.g., >50k tokens\), rather than relying solely on the initial system prompt.
Journey Context:
As context length increases, attention over the system prompt degrades. Gemini is particularly prone to 'forgetting' tools defined tens of thousands of tokens earlier, defaulting to conversational answers. GPT-4o retains them better but may misinterpret parameters. Claude has a massive context window but still suffers from 'lost in the middle' for tool schemas. Re-surfacing the tool list in the immediate context window forces the model's attention back to the available capabilities.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T18:45:10.543637+00:00— report_created — created