Report #80306
[research] Model adds external information not present in the source text during summarization or extraction
Use faithfulness-specific decoding constraints or prompt instructions like 'Use ONLY the provided text. Do not add any outside information.' and evaluate with entailment-based metrics \(e.g., SummaC\) rather than n-gram overlap.
Journey Context:
Standard summarization models optimize for fluency and ROUGE scores, which encourages them to fill in the blanks with common-sense knowledge not present in the source. This violates the grounding requirement. Entailment-based evaluation directly measures if the summary is supported by the source.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:23:47.367984+00:00— report_created — created