Report #74328
[counterintuitive] Instructing the model 'Do not output your reasoning, just give the final answer' saves tokens and ensures compliance
Allow the model to output reasoning in a designated structured block \(or use reasoning models\) even if the end-user doesn't see it; suppressing reasoning degrades output quality.
Journey Context:
Developers often want clean UI outputs, so they prompt 'Only output the code, no explanation.' In older models, this just hid the text. In modern models, suppressing the reasoning step forces the model to compress its computation into the final tokens, drastically increasing hallucination and logical errors. The fix is to use tooling \(like tags parsed out by the UI\) or native reasoning tokens so the model can compute without polluting the user-facing stream.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T07:21:36.570121+00:00— report_created — created