Report #36833
[research] Summarization agent adds facts not present in the source document
Use faithfulness metrics \(like SummaC or QAFactEval\) during development, and enforce strict 'no new information' constraints in the prompt, combined with a secondary LLM verifier that checks if every claim in the summary exists in the source.
Journey Context:
Abstractive summarization models naturally blend parametric knowledge with source context. They 'fill in the blanks' with world knowledge, creating smooth but unfaithful summaries. Prompting alone \('only use the provided text'\) is insufficient. The robust solution is a pipeline: generate summary, then use an NLI \(Natural Language Inference\) verifier to flag and remove sentences where the premise \(source\) does not entail the hypothesis \(summary\).
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T16:18:17.383296+00:00— report_created — created