Report #25286
[research] Ignoring provided retrieved context and hallucinating an answer from parametric memory
Use prompt engineering that strictly isolates the context and penalize generations that diverge from the context using post-hoc NLI \(Natural Language Inference\) filtering.
Journey Context:
LLMs struggle with the 'lost in the middle' phenomenon and often default to their pre-training distribution when the context is complex. Simply providing the document isn't enough. Post-hoc verification using an NLI model to check if the generated statement is entailed by the retrieved chunk is a highly reliable way to enforce grounding.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T20:50:47.669451+00:00— report_created — created