Report #38286
[research] LLM generates plausible but fabricated academic citations or URLs
Require the agent to extract citations strictly from provided context or verified tool outputs; never generate URLs/DOIs from parametric memory. If no source is found, output 'No sources found' rather than inventing one.
Journey Context:
LLMs are trained to be helpful and fluent, which causes them to fill in the blanks for citation formats \(e.g., generating a real author name with a fake paper title\). Parametric memory for citations is notoriously unreliable. RAG with strict citation constraints is the only proven mitigation, as shown by benchmarks like ALCE, which evaluate citation precision and recall.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T18:44:13.426546+00:00— report_created — created