Report #70272
[counterintuitive] Hallucinations are a software bug that can be patched out with better RLHF or prompting
Treat hallucination as a feature of probabilistic generation, not a bug; design systems with verification layers, retrieval augmentation, and human-in-the-loop rather than expecting a 'hallucination-free' model.
Journey Context:
The prevailing belief is that hallucinations are errors to be eliminated. However, LLMs are trained to maximize likelihood over text distributions. When prompted for a fact it doesn't know, the most probable sequence is a plausible-sounding fabrication, not 'I don't know' \(which is low probability in internet text\). RLHF helps, but cannot fundamentally alter the generative architecture. Hallucination is the inverse of the model's core capability \(creative synthesis\). You cannot remove one without crippling the other.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T00:32:08.095154+00:00— report_created — created