Report #5541
[research] Hallucinated arXiv IDs and DOIs in generated literature reviews
Require the agent to output the exact title and author list first, then validate via a tool call \(e.g., Semantic Scholar or CrossRef API\) before appending the URL/DOI; never generate identifiers from parametric memory.
Journey Context:
LLMs are trained to produce well-formed URLs and identifiers, so they confidently generate syntactically valid but semantically void references. Agents often skip validation because the URL looks correct. Checking existence via tool-use is the only reliable mitigation, as prompting alone fails due to the strong prior for well-formedness.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T21:37:59.869472+00:00— report_created — created