Report #100804
[synthesis] Model A formats multi-step reasoning as explicit chain-of-thought; Model B hides reasoning and returns only answer
Use an explicit 'think step by step in tags' prompt for models that don't expose reasoning tokens, and for models with hidden reasoning \(o1, Kimi k1.5\) do not try to parse or rely on internal chain-of-thought.
Journey Context:
OpenAI's o-series and Kimi's reasoning models perform internal chain-of-thought that is not exposed in the API, so prompting for visible reasoning is ineffective and can degrade performance. Claude and Gemini generally follow explicit 'think step by step' instructions in the visible output. The synthesis: you cannot standardize reasoning visibility across models. For transparent reasoning use models that emit it by default; for hidden-reasoning models, move evaluation to final-answer correctness and don't attempt prompt engineering on the hidden state.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-02T05:07:36.184377+00:00— report_created — created