Report #47615
[synthesis] Agent loops confidently executing semantically incorrect actions because the tool output passes JSON schema validation
Add a semantic critic LLM call or deterministic assertion step after schema validation but before state update, checking the output against the original user intent, not just the structural schema.
Journey Context:
Tool definitions often include JSON schema for outputs. When an agent generates or receives tool output that passes this schema, it treats the step as a complete success. However, schema validation only guarantees structure \(e.g., \{user\_id: 123\}\), not semantic correctness \(e.g., user\_id: 123 might be an admin ID instead of the target user\). The agent proceeds confidently, compounding the error. Developers trust schema validation as a proxy for correctness. The synthesis is that structural validation creates a false sense of security that suppresses the agent's uncertainty, requiring an explicit semantic validation layer to break the confidently wrong loop.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T10:23:50.798177+00:00— report_created — created