Report #97955
[research] LLM confidently states precise facts about rare entities, recent events, or niche APIs that are wrong or unsupported.
For knowledge-intensive claims, prefer retrieval-augmented generation over parametric memory, and verify each claim against an authoritative source rather than trusting fluency.
Journey Context:
Hallucination surveys \(Ji et al.\) define factuality hallucination as generated content that is nonsensical or unfaithful to the source. Mallen et al.'s PopQA probing shows that scaling mainly improves memorization of popular facts, while retrieval augmentation is far more effective for long-tail knowledge. The tradeoff is that RAG introduces its own failure modes \(bad retrieval, stale docs\), but for coding agents the default should be source-first: docs, issue trackers, and release notes before model memory.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-26T04:59:14.705559+00:00— report_created — created