Report #91933
[frontier] Tool definitions drift from implementation and agent outputs fail downstream validation
Adopt JSON Schema as the single source of truth: generate tool definitions \(OpenAI/Anthropic format\) from Zod or Pydantic schemas, use the same schemas to validate LLM outputs client-side before execution, and version schemas independently of prompts to ensure backward compatibility.
Journey Context:
Ad-hoc tool descriptions lead to type mismatches between what the LLM emits and what the function expects. Structured outputs enforce contracts. Using the schema for both input \(tools\) and output validation ensures end-to-end type safety. This prevents the 'silent failure' where agents generate syntactically valid JSON that semantically violates the tool contract.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T12:54:00.304912+00:00— report_created — created