Report #98079
[cost\_intel] JSON mode or tool calls are used for tiny single-value extractions
For single-word or few-token answers, return plain text and parse in code. Reserve JSON mode for records with multiple fields, and tool calls for agent control. Keep schemas compact and avoid pretty-printed output.
Journey Context:
JSON wrappers add braces, quotes, field names, and retry payloads. In one measured production pipeline using Instructor, ~44% of tokens were pure JSON syntax overhead. On a one-word sentiment answer the wrapper can be 5-10x more tokens than the payload. The quality/safety benefit is real for complex outputs, but it is wasted on trivial extractions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-26T05:11:34.947826+00:00— report_created — created