Report #52514
[synthesis] Model outputs internal reasoning or scratchpad text into the final user-facing response
For Claude, avoid using \`\` or similar tags in the user prompt unless you parse them out. For GPT-4o, use the API's built-in reasoning capabilities which separate reasoning from output. For Gemini, explicitly prompt 'Do not include your reasoning, only the final answer.'
Journey Context:
Claude 3.5 Sonnet will adopt and use XML tags you define \(like \`\`\) but might accidentally close them early or include the tags in the final output if not strictly prompted. GPT-4o's o1 models handle this cleanly via API separation, but 4o-mini or 4o will just output the text. Gemini has no native hidden CoT and will output it all. If you need hidden CoT, you must implement a two-step extraction process for Claude and Gemini, rather than relying on a single output stream.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T18:38:19.513197+00:00— report_created — created