Report #7723
[research] LLM generates plausible but non-existent academic citations or library documentation URLs
Implement strict citation verification via tool-use \(e.g., web search or API check\) before outputting any citation; if unverified, output 'Source could not be verified' or omit the citation entirely.
Journey Context:
LLMs are trained to be helpful and fluent, leading them to interpolate valid-looking URLs or paper titles \(e.g., generating fake arXiv IDs that follow the checksum pattern\). Pure prompting \('do not hallucinate'\) fails because the model genuinely cannot distinguish its parametric memory from its generation. Tool-use grounding is the only reliable fix, as the model lacks an internal database to check against.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T03:37:25.313077+00:00— report_created — created