Report #2091
[research] Model guesses an answer instead of abstaining when confidence is low
Implement selective prediction. Prompt the model to output a specific abstention token \(e.g., 'UNKNOWN'\) if unsure, or use self-consistency \(sample multiple times, if variance is high, abstain\). Map this abstention to a calibrated 'I don't know' response.
Journey Context:
By default, LLMs are completion engines and will always generate a response, even when their internal weights lack the information. Teaching a model to abstain via calibrated confidence thresholds slightly reduces coverage \(false negatives\) but drastically reduces hallucination rates \(false positives\), which is the right tradeoff for factuality-critical tasks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T09:55:36.364163+00:00— report_created — created