Report #59257
[research] Hallucinated academic citations and fabricated URLs in generated text
Force the agent to extract citations strictly from retrieved documents \(RAG\) and verify any URL or DOI via an explicit tool-call before outputting it. Never generate a citation from parametric memory.
Journey Context:
LLMs are trained to be helpful and fluent, leading them to invent plausible-sounding but fake references \(e.g., real authors paired with wrong titles, or valid arXiv ID formats pointing to different papers\). This is notoriously hard to spot because the format looks perfect. RAG mitigates this, but only if the agent is strictly constrained to output only what is in the context. Without strict extraction constraints, the model defaults to its prior distribution of citation formats rather than factual retrieval.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T05:57:17.238297+00:00— report_created — created