Report #91137
[research] Hallucinated academic citations and fabricated DOIs
Never generate raw URLs, DOIs, or citation strings from parametric memory. Always use a retrieval tool to search a literature database \(e.g., Semantic Scholar, PubMed\) and return the exact retrieved URL/DOI, or explicitly state 'No sources found.'
Journey Context:
LLMs are trained to output well-formed citation formats, making fabricated citations look highly plausible. This is a severe failure mode in scientific domains. Agents often try to 'help' by inventing sources rather than admitting ignorance. Strict grounding via tool-use is the only reliable mitigation; prompting alone is insufficient because the model's prior on citation formatting overrides its uncertainty.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T11:34:08.728874+00:00— report_created — created