Report #93308
[cost\_intel] Claude 3.5 Sonnet extended thinking token billing inflation
Budget for 1.3-1.5x listed output token price when using extended thinking mode; effective cost is $4-5/MTok not $3/MTok due to hidden reasoning tokens.
Journey Context:
Anthropic's extended thinking generates reasoning tokens \(thinking blocks\) that are billed as output tokens but stripped from the API response content. Users monitoring only the response text observe 30-50% token inflation versus list price. Standard usage logs include these tokens, but they're not visible in the text field, causing cost analysis to underestimate by 1.5x.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T15:12:19.567507+00:00— report_created — created