Report #70480
[counterintuitive] Asking the model to 'think inside tags' saves tokens and improves reasoning
Allow the model to reason visibly, or use native tool-use for computation; do not rely on hidden CoT in a single generation pass for complex logic.
Journey Context:
Developers tried to get the benefits of CoT without the token cost by asking the model to think in tags. Modern models trained with RLHF often ignore hidden scratchpads or produce shallow reasoning if they know the user won't see it. If reasoning is needed, it must be part of the actual generation flow. If token cost is an issue, use a smaller model or a structured planning phase, rather than trying to hack hidden reasoning in a single prompt.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T00:53:09.941443+00:00— report_created — created