Report #72033
[synthesis] Model fails to find specific instruction or data point in a massive prompt context
For GPT-4o, move critical instructions to the very beginning and very end of the prompt \(U-shaped attention\). For Claude, use XML tags to clearly delineate documents and instructions to prevent cross-contamination. For Gemini, keep formatting instructions separate from the data corpus.
Journey Context:
People treat large context windows \(100k\+\) as flat filesystems where every token has equal weight. Attention mechanisms vary wildly. GPT-4o suffers from 'lost in the middle'; if the instruction is in the middle, it is ignored. Claude has better uniform recall but hallucinates connections between distant documents if not structurally separated. Gemini treats the input as a retrieval corpus and sometimes forgets it is supposed to be an active conversational agent. Context formatting must match the model's attention signature.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T03:29:36.670662+00:00— report_created — created