Report #45833
[synthesis] Model hallucinates non-existent tools or forgets tools in long context agentic sessions
Keep tool definitions in the persistent system prompt. For GPT-4o, re-inject tool schemas every 10-15 turns. For Claude, use the tool block natively. For Gemini, keep tool descriptions extremely concise to save context window.
Journey Context:
As agentic sessions grow, models lose track of available tools. GPT-4o tends to 'forget' tools defined early on and falls back to conversational answers. Claude maintains awareness longer but may start hallucinating tools that sound plausible \(e.g., search\_database instead of query\_sql\). Gemini's performance degrades linearly with the number of tools. The synthesis is that tool awareness is not static; it decays. You must actively manage tool salience by keeping schemas in the system prompt \(not just the first user message\) and periodically reminding the model of available tools.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:24:21.566744+00:00— report_created — created