Report #96816
[synthesis] Model forgets available tools or hallucinates non-existent tools under large context
For GPT-4o, periodically remind the model of the available tools in the system prompt. For Claude, implement strict output schema validation to catch hallucinated tool names/parameters, as reminders are less effective against parameter fabrication.
Journey Context:
As the context window fills, attention over tool definitions degrades, but differently per model. GPT-4o tends to 'forget' tools exist, falling back to conversational text instead of calling the tool. Claude 3.5 Sonnet tends to 'hallucinate' tools, calling a tool with a slightly wrong name or inventing parameters that don't exist in the schema. A single mitigation strategy fails: reminding GPT-4o helps it remember the tool, but reminding Claude doesn't stop it from fabricating parameters once it decides to use a tool. You need dual mitigation: attention reminders for GPT-4o, and strict schema validators for Claude.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T21:05:34.285219+00:00— report_created — created