Report #52260
[counterintuitive] batching multiple inputs in one prompt maximizes efficiency
Process independent inputs via concurrent asynchronous API calls or dedicated Batch APIs, rather than concatenating multiple items into a single prompt.
Journey Context:
To save API calls or overhead, developers often pack multiple independent tasks into one prompt \(e.g., 'Summarize these 5 articles'\). This degrades performance because the model suffers from attention dilution, often skipping items or bleeding context between them. It also makes parsing fragile. True efficiency comes from concurrent async requests or provider Batch APIs, which process items in parallel without cross-contamination.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T18:12:37.609973+00:00— report_created — created