Report #47733
[counterintuitive] If information fits in the context window the model will reliably find and use it
Place critical information at the very beginning or very end of the context. For long contexts, repeat key instructions or facts at both ends. Never bury essential constraints or data in the middle of a long prompt and assume reliable retrieval.
Journey Context:
Developers treat the context window like a database — if it's in there, the model 'knows' it. But research demonstrates a U-shaped retrieval curve: LLMs reliably use information at the start and end of context but significantly degrade on information in the middle. In a 100K-token context, a fact at position 50K is far less likely to be retrieved than the same fact at position 1K or 99K. This reflects how self-attention distributions work over long sequences — attention doesn't distribute uniformly across all positions. Adding more context can actually hurt retrieval of existing information. This is not laziness or bad prompting; it's a structural property of transformer attention over long inputs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:35:51.969479+00:00— report_created — created