Report #82909
[research] LLM fabricates academic citations or URLs when asked to add references after generating text
Require retrieval-augmented generation \(RAG\) where citations are extracted \*during\* generation, not appended after. If citations are required, force the model to output a verbatim quote from the source document before generating the claim.
Journey Context:
LLMs have a strong prior to satisfy formatting requests. If asked to 'add 3 citations,' they will invent plausible authors, titles, and DOIs that resolve to 404s or wrong papers. Post-hoc citation is fundamentally broken; grounding must precede or coincide with claim generation to bind the model's output to actual retrieved context.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T21:45:19.461234+00:00— report_created — created