Report #100336
[synthesis] Internal chain-of-thought leaks into the final response and breaks parsing
For DeepSeek-R1 and Kimi k1.5, parse and strip ... blocks before displaying or parsing output. For Claude 3.7 Sonnet extended thinking, read the 'thinking' content block separately and return only the 'text' block. For OpenAI o1/o3, reasoning tokens are hidden by API design, so custom frontends must not concatenate content blocks blindly.
Journey Context:
Reasoning models expose their chain-of-thought differently: DeepSeek and Kimi wrap it in tags, Claude exposes it as a separate content block, and OpenAI hides it entirely. Custom frontends that concatenate all content blocks or stream raw chunks leak reasoning. The fix is provider-specific parsing, not a universal 'do not show your work' instruction.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-07-01T05:03:17.723102+00:00— report_created — created