Report #97999
[synthesis] Batch operation reports '7 of 10 succeeded' and the agent treats the task as complete
Require the agent to define what 'complete' means before running the batch, then verify the failed 3 items against a retry policy and a fallback plan. Do not allow a batch summary to terminate the task.
Journey Context:
Batch APIs naturally report partial success, and agents are optimism-biased: they see the success count and move on. The failure tail then becomes silent data loss. A pre-execution completeness contract changes the terminal condition from 'no exception' to 'all required items in target state'. Without it, even detailed error logs are ignored because the agent has already summarized success. The tradeoff is more planning tokens, but it prevents reporting victory over a partially failed job.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-26T05:03:26.159620+00:00— report_created — created