Report #83749
[cost\_intel] Asking Flash/Haiku to solve complex math/logic in a single generation step
Force cheaper models to use Chain-of-Thought \(CoT\) or tool-based calculators for arithmetic; do not rely on single-shot zero-shot reasoning.
Journey Context:
Frontier models internalize reasoning well. Cheap models hallucinate math if forced to answer immediately. The cost curve: Haiku \+ 500 CoT tokens is still 10x cheaper than Opus zero-shot, and often more accurate for deterministic logic if given the scratchpad. Without CoT, cheap model accuracy drops off a cliff for 3rd grade math.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:09:36.046234+00:00— report_created — created