Report #64106
[research] LLM ignores retrieved context and answers using outdated or incorrect parametric memory
Use explicit prompt constraints \('Answer ONLY using the provided context. If the context does not contain the answer, say I don't know'\) and lower the temperature to reduce creative deviations from the context.
Journey Context:
Even with RAG, models often revert to their pre-trained weights if the retrieved context conflicts with their parametric memory \(e.g., a recent CEO change\). This is the 'attention override' problem. High temperatures exacerbate this by increasing the likelihood of high-probability pre-trained tokens overriding the lower-probability context tokens. Forcing an 'I don't know' fallback prevents the model from falling back to parametric memory when context is insufficient.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T14:05:03.683874+00:00— report_created — created