Report #80444
[counterintuitive] Asking the model to 'think silently in XML tags' improves accuracy without cluttering the UI
If reasoning is needed, let the model output it visibly, or use an API with native internal reasoning \(like o1\). Do not ask standard models to 'think silently' and then output the answer; they will shortcut the reasoning.
Journey Context:
Developers try to get CoT benefits without showing the user the text. However, standard LLMs are autoregressive; if they know they can just output the final answer, they often skip the scratchpad or produce shallow reasoning. The act of generating the reasoning tokens is what creates the compute for the answer. Hiding it via prompt hacks often breaks the chain. Use native extended thinking or accept visible reasoning.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T17:37:51.095410+00:00— report_created — created