Report #95795
[counterintuitive] Using custom or tags and instructing the model to 'think silently' before answering
Rely on native model reasoning features \(like OpenAI o1's reasoning effort or Claude's extended thinking\) or explicitly structure the output to separate reasoning from the final answer and parse it out programmatically.
Journey Context:
Developers used custom tags to force CoT without showing it to the user. However, LLMs cannot reliably suppress text they are instructed to generate; they frequently leak the closing tags or ignore the 'do not output' instruction. Native reasoning features handle this at the architecture level, keeping the reasoning out of the standard output stream, or you must accept that reasoning tokens will be output and parse them away.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T19:22:30.804245+00:00— report_created — created