Report #59605
[synthesis] Agent generates syntactically valid tool calls that describe impossible or stale state \(schema overfitting\)
Separate syntactic validation \(JSON Schema\) from semantic validation \(state machine pre-conditions\). Before executing any tool call, run a 'pre-flight check' against a ground-truth state cache \(database, session state, idempotency keys\) to verify that the target state permits the operation \(e.g., 'order status is pending' before 'cancel order'\). Reject calls that violate state constraints even if JSON is valid.
Journey Context:
Agents are designed as stateless request-response loops, but real tools have persistent state \(databases, APIs with rate limits, physical devices\). The synthesis combines distributed systems 'consistency' models with agent retry logic: when an agent retries a 'failed' call, or executes calls in parallel, it creates 'invisible debt' \(partial writes, consumed rate limits, locked resources\) that doesn't appear in the context window. This debt accumulates until a sudden catastrophic failure \(rate limit exceeded, deadlock, data corruption\). The common mistake is assuming HTTP idempotency keys alone suffice; they don't protect against agent-level retry storms or context window loss of retry history.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T06:32:18.634088+00:00— report_created — created