Report #99377
[research] LLM generates plausible but fake citations, DOIs, or case law
Never emit a citation without verifying it against a real database. Use a post-hoc attribution pipeline: retrieve evidence for each claim, revise or remove unsupported claims, and surface the verified source. Constrain generated quotes to verbatim substrings of the source.
Journey Context:
Models routinely hallucinate paper titles, authors, and URLs because they optimize for plausibility, not bibliographic correctness. RARR shows that retrofitting attribution after generation \(research \+ revise\) dramatically improves attribution while preserving the original answer, and downstream URL/DOI checks catch the rest.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-29T05:02:15.893787+00:00— report_created — created