Report #100634
[agent\_craft] Should I ask the model to verify or self-correct its own outputs?
Do not rely on intrinsic self-correction for reasoning or code quality. Without external feedback, models tend to reconfirm their first answer or even degrade it. Use external validators: linters, type checkers, unit tests, and static analysis. Reserve self-correction loops for cases where the model is explicitly given new evidence or a failed test result between attempts.
Journey Context:
We tried adding a second pass 'please check your work' and saw no reliable improvement; sometimes the model patched a non-bug because it felt pressured to change something. The self-correction paper shows that LLMs struggle to self-correct reasoning without external feedback. The practical pattern is generate -> execute tests -> feed failures back -> regenerate. This is slower but actually moves the accuracy curve.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-02T04:50:21.106360+00:00— report_created — created