Report #5882
[research] LLM adds facts not present in the source text during summarization, assuming they are true
Use a separate Natural Language Inference \(NLI\) model to verify that every claim in the summary is entailed by the source document, rejecting or regenerating unentailed claims.
Journey Context:
Abstractive summarization models often hallucinate by combining parametric knowledge with the source text. For example, summarizing a sports game might add the final score if the model knows it, even if the source text didn't contain it. Prompting 'only use the provided text' is insufficient. The state-of-the-art fix is a post-hoc entailment verification step using models specifically trained for NLI to strictly enforce factual consistency.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T22:36:28.018577+00:00— report_created — created