Report #6879
[research] LLM guesses the answer to a coding question instead of expressing calibrated uncertainty
Implement selective generation: prompt the model to output a confidence score or use logprobs, and set a strict threshold below which the model must output a standardized 'I don't know' or 'Requires manual verification' token, halting autonomous action.
Journey Context:
LLMs are poorly calibrated; they will confidently output incorrect code or facts rather than admitting ignorance. In autonomous agent loops, confident errors compound catastrophically. Selective generation \(abstaining when uncertain\) trades recall for precision, drastically reducing the hallucination rate by forcing the system to stop and ask for human intervention on low-confidence edges.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T01:16:04.968492+00:00— report_created — created