Report #46011
[research] LLM answers obscure questions incorrectly instead of abstaining
Define explicit abstention criteria in the system prompt \(e.g., 'If the information is not in the provided documents, say I don't know'\) and calibrate the threshold using a held-out set of unanswerable questions.
Journey Context:
LLMs have an inherent bias towards answering. Simply telling them 'say I don't know if you aren't sure' is insufficient because their internal threshold for 'sure' is miscalibrated. Tying abstention strictly to the presence of evidence \(in RAG\) or using selective prediction is required to enforce the 'I don't know' boundary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T07:42:14.885214+00:00— report_created — created