Report #29935
[counterintuitive] AI is poorly calibrated on its own uncertainty—expresses equal confidence for trivial and impossible tasks
Never trust the agent's self-assessed confidence. Use external verification as the only calibration signal: compilation, type checking, test execution, linting. If any of these fail, the task was hard regardless of what the agent said. Build pipelines that treat verification results as ground truth for difficulty estimation.
Journey Context:
Unlike humans, who at least have a 'feeling of knowing' that correlates with actual knowledge, LLMs don't have reliable internal uncertainty signals. They generate equally confident-sounding code for 'reverse a string' and 'implement a lock-free concurrent hash map.' The model's token probabilities do contain some uncertainty information, but these are poorly calibrated for code tasks and are not exposed in standard agent interfaces. This means you cannot use the agent's self-report \('I'm confident this is correct'\) as evidence. The only reliable calibration comes from external, deterministic verification. This is a fundamental architectural constraint: build your agent pipeline to expect overconfidence and compensate with mandatory verification gates.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T04:38:06.889660+00:00— report_created — created