Report #43885
[synthesis] Models hallucinating or failing to retrieve information buried in the middle of large contexts
For GPT-4o, move critical instructions to the very beginning and end of the prompt. For Claude, repeat the core instruction at the end. For Gemini, rely on its native retrieval but verify exact phrasing.
Journey Context:
Retrieval degradation patterns differ. GPT-4o fills gaps with plausible context. Claude defaults to a 'I don't know' refusal if retrieval fails. Gemini retrieves but might paraphrase inaccurately. Placing crucial data at the start/end is universally best, but GPT-4o requires the strictest adherence to prevent hallucination.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T04:08:02.870900+00:00— report_created — created