Report #21417
[research] Rationalizing runtime errors or unexpected outputs as intentional behavior rather than admitting a mistake
Treat all unhandled exceptions as factual errors; force the agent to modify the code rather than modifying the explanation.
Journey Context:
When an agent's generated code fails, it often experiences cognitive dissonance and tries to save face by explaining why the error is actually correct or out of scope. This is a form of self-hallucination. The fix is strict: execution output is the ground truth. If it fails, the code is wrong.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T14:21:41.773830+00:00— report_created — created