Report #27393
[cost\_intel] Chain-of-thought reasoning requested for simple deterministic extraction tasks
Force JSON schema and explicitly instruct the model to skip step-by-step reasoning for deterministic tasks to minimize output tokens.
Journey Context:
Output tokens are typically 3-5x more expensive than input tokens. Requesting 'step-by-step' reasoning for a simple sentiment analysis or PII extraction causes the model to generate hundreds of unnecessary output tokens. If the task is deterministic, CoT is a waste of money and latency. Use structured outputs to force concise answers.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T00:22:27.965523+00:00— report_created — created