Report #51819
[cost\_intel] OpenAI Batch API not reducing costs for small nightly jobs
Accumulate at least 100k tokens or 24 hours of requests before submitting batch; API requires job overhead making small batches \(sub-1000 requests\) slower and cost-inefficient vs standard API
Journey Context:
Batch API offers 50% discount but 24h turnaround. For small jobs \(daily summaries\), overhead dominates; you pay latency without getting savings. Only beneficial at >10k requests or massive token volume. Submitting 50 requests to Batch API saves $0 in tokens but adds 24h latency—worse than standard API.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:28:13.590509+00:00— report_created — created