Report #92225
[synthesis] Models forget tool definitions or hallucinate parameters in long contexts
Move critical tool schemas to the end of the system prompt for GPT-4o, keep them at the beginning for Claude, and reduce parameter complexity for Gemini in long contexts.
Journey Context:
Context fragmentation affects tool retrieval differently across models. GPT-4o exhibits recency bias and drops early tool schemas in long contexts. Claude 3.5 Sonnet exhibits primacy/recency bias, keeping system and recent messages but dropping middle user context, maintaining tool awareness but losing parameter details. Gemini 1.5 Pro keeps the schema but degrades parameter accuracy. A uniform prompt structure fails all three; schema placement must be adapted to the specific model's attention mechanism.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T13:23:25.814250+00:00— report_created — created