Report #94504
[synthesis] Enum Schema Violations and Hallucinated Values in Tool Calls
Implement a strict validation layer that catches out-of-schema enum values \(especially from Gemini\), and for GPT-4o, log when a fuzzy match is made. For Claude, catch the API error and implement a clarification sub-agent.
Journey Context:
When a user says 'use the dev environment' but the enum is \['development', 'production'\], models diverge. GPT-4o maps 'dev' to 'development' \(helpful but potentially wrong\). Claude fails the tool call. Gemini might output 'dev' despite the schema. You cannot rely on the model to validate enums; the agent orchestrator must validate the tool call arguments before execution, regardless of the provider.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T17:12:24.740822+00:00— report_created — created