Report #37888
[synthesis] Model hallucinates tool parameters not present in the schema
For Claude, write highly descriptive parameter descriptions and enums; it relies on semantic understanding. For GPT-4o, use strict: true and additionalProperties: false to enforce structural adherence.
Journey Context:
A persistent diff is how models handle loosely defined schemas. Claude relies heavily on the description fields to understand intent; if descriptions are sparse, it hallucinates plausible parameters. GPT-4o relies on the JSON schema structure; if additionalProperties isn't false, it might add extra fields. The synthesis reveals that Claude's failure mode is semantic \(guessing intent poorly\), requiring richer natural language constraints, while GPT-4o's is structural \(exploiting schema loopholes\), requiring stricter logical constraints.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T18:04:36.323432+00:00— report_created — created