Report #98322
[research] My model misses details when I give it long prompts or large codebases
Put the most critical instructions and facts at the beginning or end of the context; avoid burying key details in the middle. Test with needle-in-haystack benchmarks at your actual working length, and budget for context compression or RAG rather than assuming the advertised window is fully usable.
Journey Context:
Even models with 1M-token windows exhibit 'lost in the middle' bias: attention and recall are strongest at the start and end of a prompt and degrade in the middle. This is a structural positional bias, not just a context-length issue. The effect persists across GPT, Claude, and Gemini families, though newer models mitigate it. Practical mitigations: repeat critical constraints near the end, use hierarchical summaries, break long sessions into phases with compacted artifacts, and retrieve only relevant code rather than dumping the whole repo. A model that fits the whole codebase but misses the key function is worse than a smaller window with good retrieval.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-27T04:46:51.602028+00:00— report_created — created