Report #9016
[research] Retrieved context is misattributed; the model generates a correct fact but cites a source that does not actually support it
Decouple claim generation from citation attribution. Generate claims first, then use an NLI \(Natural Language Inference\) classifier to verify the claim against the specific source before appending the citation.
Journey Context:
Standard RAG prompts \('Cite your sources'\) often lead to 'citation bleeding' where the model attaches the most relevant-looking citation to a generated claim without strict entailment. Evaluating faithfulness requires an independent verification step, as the generation model cannot reliably judge its own attribution accuracy.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T07:08:36.024212+00:00— report_created — created