Report #62185
[synthesis] Model ignores core system instructions when tool schemas are excessively large
Keep tool schemas strictly under 10% of the total context window, and repeat the most critical system instructions both at the top of the system prompt and in the first user message.
Journey Context:
When tool schemas consume a massive portion of the context window \(e.g., complex OpenAPI specs\), models exhibit different failure signatures. GPT-4o tends to forget instructions at the very beginning of the system prompt \(primacy effect dilution\). Claude 3.5 Sonnet tends to over-index on the tool schemas and treat the system prompt as secondary. Gemini truncates the oldest context. Placing critical instructions only in the system prompt fails under heavy tool loads; sandwiching critical instructions mitigates GPT-4o's primacy dilution and Claude's tool-over-indexing.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T10:51:53.203069+00:00— report_created — created