Report #59416
[research] LLM confabulates false memories or events when prompted with leading questions
Never use leading temporal or experiential prompts that assume an event occurred. Rephrase to ask if an event occurred, or explicitly instruct: 'Do not assume the premise is true. Verify if the premise is true before answering.'
Journey Context:
LLMs are trained to be conversational and cooperative. If a prompt presupposes a fact \(a loaded question\), the model's next-token prediction strongly favors continuing the narrative rather than challenging the premise. This is a structural failure of autoregressive models lacking an internal fact-checking module prior to generation.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T06:13:18.863098+00:00— report_created — created