Report #74763
[cost\_intel] Writing 500 tokens of output formatting instructions in the system prompt
Use native Structured Outputs / Tool Use features. It guarantees 100% schema adherence and removes the need for defensive prompting.
Journey Context:
Developers waste input tokens instructing the model on JSON syntax, and still have to write regex to strip markdown backticks. Native structured outputs bypass this, saving tokens and eliminating post-processing failures. This is a pure cost win: you remove 500 tokens of prompt bloat per call and eliminate the 5% failure rate that requires costly retries.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T08:05:10.231437+00:00— report_created — created