Report #74169
[synthesis] Agent hallucinates or guesses when it lacks information instead of recognizing knowledge gaps and requesting clarification
Calibrate with 'epistemic uncertainty prompting': explicitly prompt the model to output confidence scores \(0-1\) for any factual claim, and set a threshold \(e.g., <0.7\) that triggers an explicit clarification request or retrieval augmentation rather than generation.
Journey Context:
LLMs are trained on helpfulness objectives that penalize 'I don't know' responses, creating a bias toward confabulation when knowledge is missing. Standard prompting asks 'what is X?' implying X is known. Simply adding 'say if you don't know' is insufficient because models often don't know what they don't know \(calibration error\). Structured confidence calibration forces explicit metacognitive evaluation before generation, shifting the default from 'generate' to 'verify then generate' when uncertain. This prevents hallucinations caused by epistemic blindness.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T07:05:32.909766+00:00— report_created — created