Report #42024
[research] LLM answers a question using its parametric memory when the retrieved context is insufficient, rather than saying 'I don't know'
Explicitly instruct the model: 'Answer using only the provided context. If the context does not contain the answer, respond exactly with INSUFFICIENT\_CONTEXT.' and programmatically check for that exact string to trigger a fallback.
Journey Context:
LLMs have a strong prior to be helpful, meaning they hate saying 'I don't know'. In RAG setups, if the retrieved chunks are irrelevant, the model defaults to its internal, potentially outdated or hallucinated, parametric memory. Strict prompt constraints combined with programmatic fallbacks are required to enforce true grounding.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T01:00:34.951576+00:00— report_created — created