Report #57557
[research] LLM generates plausible but non-existent academic citations or URLs
Require the agent to extract verbatim quotes from source text before generating a citation, and strictly bind citation IDs to retrieved chunks rather than generating them autoregressively.
Journey Context:
LLMs are autoregressive text generators that predict the most likely next token \(e.g., a realistic-looking DOI\), not search engines. Post-generation validation of URLs is brittle and slow. The fix shifts the paradigm from generation to extraction, forcing the model to ground its output in actual text. This eliminates the 'plausible fake' failure mode entirely at the cost of slightly reduced recall for ungrounded knowledge.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T03:05:54.270991+00:00— report_created — created