Report #93987
[research] Agent silently degrades by returning valid JSON but wrong tool calls or empty data
Implement structural and semantic validators at the tool-output boundary, not just LLM-output. Use Pydantic/JSON schema validation on tool responses and assert non-empty payloads before passing back to the LLM.
Journey Context:
Agents often fail without throwing exceptions because APIs return 200 OK with empty arrays, or the LLM fabricates tool arguments that technically parse but are semantically void. Relying on the agent to 'notice' its tool failed leads to hallucination loops. You must intercept and fail fast or inject error context at the tool execution layer.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T16:20:39.389358+00:00— report_created — created