Report #84160
[frontier] Agent gradually loses strict adherence to tool schemas \(API parameter types, required fields\) after 25\+ turns of successful calls, leading to hallucinated parameters or type mismatches that were validated at session start
Implement Schema Hardening Anchors — replace the agent's schema memory with a deterministic JSON Schema validator middleware that intercepts all tool calls, compares them against the original schema stored in an immutable key-value store \(never passed through the LLM context\), and rejects mismatches before execution, forcing a retry with correct parameters.
Journey Context:
Teams trust the agent to 'remember' the API spec after showing it once. But schemas are negative spaces \(what's NOT allowed\) similar to constraints. The agent drifts toward 'typical' API calls seen in training data rather than the specific schema. Middleware validation is the only reliable fix because it sits outside the LLM's drift zone. This emerged from 2025 enterprise deployments where SQL agents gradually omitted WHERE clauses in UPDATE statements despite explicit schema prohibitions against table-wide updates.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T23:51:01.237088+00:00— report_created — created