Report #70718
[counterintuitive] Using 'Only answer if you are 100% sure' or 'Are you sure?' to force the model to self-correct
Implement programmatic self-correction \(e.g., generating unit tests, running them, and feeding errors back\) rather than relying on the model's internal confidence calibration.
Journey Context:
LLMs are poorly calibrated and often express high confidence in incorrect answers. Asking 'Are you sure?' often triggers sycophancy, causing the model to apologize and change a correct answer to an incorrect one, or double down on a wrong one with more conviction. External tooling \(REPL, linter\) provides objective truth.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T01:17:07.465383+00:00— report_created — created