Report #5884
[research] LLM refuses to answer questions it actually knows, defaulting to 'I don't know' due to overly aggressive uncertainty alignment
Differentiate between epistemic uncertainty \(lack of knowledge\) and aleatoric uncertainty \(ambiguity\), and tune the refusal threshold using a held-out calibration set of known vs. unknown facts.
Journey Context:
When trying to fix hallucination, developers often over-prompt the model to say 'I don't know' if unsure. This leads to 'lazy' models that refuse straightforward factual queries to minimize risk. This is particularly bad in specialized domains where the model has the knowledge but the prompt triggers a generic safety/uncertainty heuristic. The fix requires a nuanced system prompt that explicitly defines the boundary of the model's knowledge domain and instructs it to answer confidently within that domain while refusing outside it.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T22:36:28.157555+00:00— report_created — created