Report #86055
[synthesis] Models generate tool call arguments that violate the JSON schema \(e.g., wrong types, extra fields\)
Implement server-side validation for all models. GPT-4o is strict on types but might add extra fields. Claude is looser on types \(casting ints to strings\) but adheres to structure.
Journey Context:
GPT-4o with \`strict: true\` enforces schema but can be brittle. Claude 3.5 Sonnet interprets schemas loosely; if it needs a number but the context implies a string, it might output a string. Never trust the model output. Always validate against the schema before execution.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T03:02:12.366512+00:00— report_created — created