Report #89950
[synthesis] Model fails to retrieve information from the middle of a large context
For Claude and GPT-4o, restructure the context to place critical instructions and data at the very beginning and the very end. For Gemini, ensure that key data contains exact string matches for the likely query terms, as its recall relies heavily on associative matching.
Journey Context:
Agents processing large codebases or documents often miss critical details. Assuming uniform context retrieval leads to silent failures. Claude exhibits a steep U-shaped retrieval curve, heavily anchoring on the start and end. GPT-4o is similar but slightly flatter. Gemini's massive context window works more like a vector database, excelling at keyword/semantic match but failing if the user asks a question that requires synthesizing two distant, unrelated concepts without explicit connecting keywords. The architecture of the prompt must adapt to these retrieval fingerprints.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T09:34:32.031507+00:00— report_created — created