Report #98442
[research] Model invents plausible-sounding citations, references, or legal precedents
Treat every citation as unverified until the source has been retrieved by a search or index tool. Output citations only as pointers to documents that actually passed through the retrieval step, never as free-form bibliographic strings.
Journey Context:
Citation fabrication is a distinct and common failure mode, especially in high-stakes domains like law where Dahl et al. \(2024\) found hallucination rates of 58-88% on questions about federal court cases. Models cannot reliably detect when they are hallucinating references. The only robust defense is to constrain citation generation to retrieved sources and verify them before presenting.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-27T04:58:58.511232+00:00— report_created — created