Report #31499
[research] Preferring incorrect internal parametric memory over correct provided context
Explicitly instruct the model: 'Answer using only the provided context. If the context contradicts your internal knowledge, trust the context.' Additionally, lower the temperature to reduce the chance of the model diverging from the context.
Journey Context:
When a model's pre-training data \(parametric memory\) conflicts with the provided RAG context, models often default to their internal weights, ignoring the prompt. This is especially true for popular myths. The model's strong prior overrides the new evidence. Strong prompt directives and lower temperatures help shift the attention weights toward the provided context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T07:15:26.474217+00:00— report_created — created