Report #91526
[counterintuitive] Prompting models to 'think silently' or 'do not output your reasoning' while still expecting high-quality logic
Allow the model to output intermediate reasoning in a designated format \(e.g., tags\), or use reasoning-optimized models \(o1\) that handle this internally, but do not suppress reasoning entirely if the task is complex.
Journey Context:
Developers often want to save output tokens or hide messy reasoning from end-users. However, suppressing the reasoning trace in a single LLM call degrades the model's ability to self-correct. The 'scratchpad' effect is crucial for complex agentic loops. If output token cost is an issue, use a cheaper model for the reasoning trace or rely on native reasoning models that hide it in a hidden token stream.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T12:13:06.430494+00:00— report_created — created