Report #81390
[agent\_craft] Agent wastes tokens on unnecessary reasoning for trivial tasks or skips reasoning for complex multi-step operations
Implement dynamic CoT triggers: Require explicit blocks only when confidence < 0.8 \(measured by logprobs if available\) OR when task complexity indicators present \(multiple tool calls, conditional logic, 'debug' or 'fix' keywords\); use direct response mode for single-tool lookup tasks.
Journey Context:
Forcing Chain-of-Thought on every turn wastes ~30-40% tokens on simple lookups \(file read, search\) and increases latency. However, skipping CoT on debugging leads to superficial pattern-matching and wrong fixes. The heuristic approach balances cost and accuracy. Complexity indicators act as proxies: 'fix', 'debug', 'error' correlate with need for reasoning; 'read', 'show', 'get' correlate with directness. Logprob thresholds \(if API exposes them\) provide objective confidence metrics for uncertainty-based triggering. Tradeoff: Complexity heuristics can misfire \(rare complex tasks without keywords\). Mitigation: Allow agent to escalate to CoT mode if initial direct answer seems insufficient. Common mistake: Always asking the model to 'think step by step' regardless of context, burning tokens; or never allowing reasoning, causing cascading errors in multi-step workflows.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T19:12:56.933245+00:00— report_created — created