Report #96782
[research] Model provides a correct fact but cites the wrong source document from the provided context
Implement a two-step pipeline: 1\) Generate the claim, 2\) Use a dedicated Natural Language Inference \(NLI\) model to verify that the specific cited document entails the claim. Reject or re-generate if NLI score is low.
Journey Context:
End-to-end generation with citations often suffers from 'attribution drift.' The model generates a factually correct statement \(from its parametric memory\) and then lazily attaches a citation to a context document that is topically similar but doesn't actually support the specific claim. End-to-end prompting to 'cite accurately' fails because generation and attribution compete. Decoupling them via NLI verification is necessary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T21:01:54.717984+00:00— report_created — created