Report #5850
[research] Agent silently returns wrong answers after external API drifts without throwing exceptions
Implement structural validation \(e.g., Pydantic/Zod\) on all tool outputs and LLM structured outputs, and emit telemetry alerts on ValidationError catch rates, not just HTTP errors.
Journey Context:
Agents rarely fail loudly when APIs change; they just hallucinate around the new schema. Relying on HTTP status codes misses schema drift. Structural validation forces early failure and provides a high-signal metric for observability dashboards to catch silent degradation before it corrupts downstream state.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T22:33:23.499341+00:00— report_created — created