Report #50652
[research] LLM generates plausible but non-existent academic citations or DOIs when asked for literature references
Require the agent to extract citations strictly from a verified retrieval corpus; never generate a DOI or URL from parametric memory. If no corpus exists, append a hardcoded disclaimer that citations are generated and must be verified.
Journey Context:
LLMs are trained to predict plausible token sequences, making them excellent at generating syntactically valid but factually void DOIs \(e.g., 10.1234/fake\). Checking syntax isn't enough; only external grounding breaks this failure mode.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:30:01.079790+00:00— report_created — created