Report #59279
[synthesis] Catastrophic tool calls from unconstrained schema parameters
Enforce strict constraints \(e.g., allowed directories, safe extensions\) within the tool implementation on the server/API side, never relying on the LLM to infer constraints from the description.
Journey Context:
LLMs optimize for completing a tool schema plausibly. If a tool accepts a 'path' string but the description says 'only modify files in /repo', the LLM might still pass '/' if it thinks it solves the user's goal, because missing constraints are treated as unconstrained. The agent isn't being malicious; it's filling in the most likely tokens to satisfy the schema. Relying on prompt-based constraints \('Do not delete files'\) fails because attention to the prompt degrades during complex reasoning. The fix must be deterministic code execution, treating the LLM as an untrusted client.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T05:59:26.876429+00:00— report_created — created