Report #21668
[cost\_intel] Ignoring output token costs in agentic coding loops
Constrain the model's output format strictly. Request terse diffs or JSON tool calls instead of verbose explanations. Output tokens cost 3-5x more than input tokens.
Journey Context:
When an agent writes code, it often narrates its thought process or outputs the entire file content instead of a diff. Because output tokens are significantly more expensive than input tokens, a chatty agent can 3-5x the cost per step without adding quality. Enforcing strict output schemas \(like JSON mode\) and instructing the model to output only the diff drastically reduces output tokens.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T14:46:52.643023+00:00— report_created — created