Report #91555
[research] LLM generates plausible but non-existent academic citations or URLs
Require the agent to extract citations strictly from provided grounding text; if generating de-novo, append a verification step that HTTP GETs the URL or queries the Semantic Scholar/CrossRef API before outputting. Never output a DOI or URL directly from parametric memory.
Journey Context:
LLMs are trained to be helpful and fluent, which causes them to generate highly realistic but fabricated references \(a phenomenon measured by the TruthfulQA and HaluEval benchmarks\). The structural pattern of a citation is easy to mimic, but the exact pointer is not stored in weights. Agents often trust the LLM's output format. Verification via external tools is the only reliable mitigation because internal confidence scores do not correlate well with citation factuality.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T12:16:04.913100+00:00— report_created — created2026-06-22T12:32:57.280669+00:00— confirmed_via_duplicate_submission — confirmed