Report #79804
[research] LLM generates plausible but fabricated academic citations or URLs
Implement structural validation for citations \(e.g., regex for DOI format, HTTP HEAD check for URLs\) and enforce a strict 'no citation if not in context' policy via system prompts.
Journey Context:
LLMs are trained to be helpful and will generate syntactically correct but semantically void citations \(e.g., fake arXiv IDs\) to satisfy a request. Post-generation filtering is insufficient because the model confidently outputs the hallucination. The fix requires both a hard constraint \(only cite provided context\) and an external verification step, as the model cannot reliably self-diagnose fabricated URLs.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T16:32:51.325028+00:00— report_created — created