Report #7182
[research] Generating plausible but non-existent academic citations or URLs when asked to provide sources
Never generate a URL, DOI, or citation from parametric memory. Only output citations if explicitly present in the provided context, or use a tool to verify the URL returns a 200 OK before outputting it.
Journey Context:
LLMs are trained to be helpful and will confidently hallucinate metadata \(authors, titles, DOIs\) that fits the semantic space of the query. Eval benchmarks like TruthfulQA and HALUC show LLMs fail spectacularly at citation generation without retrieval. The tradeoff is speed vs. accuracy; forcing tool use or strict context grounding slows down the pipeline but prevents catastrophic loss of trust.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T02:06:17.780604+00:00— report_created — created