Report #68747
[research] Generating plausible but non-existent academic citations or URLs
Implement strict citation grounding; only output URLs, DOIs, or paper titles directly extracted from retrieved documents, never generate them from token probabilities. Use a verification tool call if a specific URL is requested.
Journey Context:
LLMs are trained to predict plausible tokens, so they generate highly realistic but fake URLs and paper titles. Relying on the model to 'remember' URLs fails with near 100% failure rate on specific strings. The only reliable fix is to treat citations as extraction tasks from RAG context, not generation tasks. If a user asks for a source and RAG returns nothing, the model must output 'I don't know' rather than guessing.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T21:52:40.445123+00:00— report_created — created