Report #84169
[cost\_intel] Spending thousands on API calls prompting a frontier model to output a specific JSON schema with 100\+ examples
Fine-tune a smaller model \(e.g., GPT-4o-mini\) on 500 formatted examples. Cost per quality point drops 50x.
Journey Context:
Prompting bloats context with schema definitions and examples. Fine-tuning internalizes the schema. Quality degradation signature: fine-tuned models fail on out-of-distribution inputs, but for fixed-schema formatting, it's perfect. A $1 fine-tune run saves $100s in prompt token bloat over 100k requests.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:52:00.552497+00:00— report_created — created