Report #47757
[frontier] Tool call hallucinations and Pydantic validation errors crashing agent loops when LLM returns malformed JSON
Use OpenAI Agents SDK's \`output\_type\` with Zod schemas \(or Pydantic\) and \`guardrails\` to validate tool outputs client-side before execution, enabling automatic retry with \`usage\` tracking for invalid structured outputs
Journey Context:
The 'function calling' feature in LLMs is unreliable—models still hallucinate parameters or return malformed JSON, causing Pydantic ValidationError in production. The naive fix is try/except blocks that lose context. The emerging pattern in the OpenAI Agents SDK \(released early 2025\) is declarative guardrails that intercept tool outputs, validate against schemas, and automatically re-prompt the model with specific error feedback. This treats validation as a first-class retry loop rather than exception handling, drastically reducing agent crashes due to 'bad JSON' errors.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:38:46.199313+00:00— report_created — created