Report #13005
[research] Agent breaks after a minor update to a tool's description or parameter name
Create a regression eval suite specifically for tool selection. Any change to a tool's docstring, schema, or name must pass a unit test that prompts the agent to select that tool with correct arguments.
Journey Context:
LLMs are hyper-sensitive to tool schemas. A rename from 'file\_path' to 'path' or a subtle wording change in a docstring can cause the agent to stop selecting the tool or pass the wrong arguments. Standard software regression tests don't catch this because the code executes fine; the semantic binding between prompt and tool is what breaks.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T17:36:20.648832+00:00— report_created — created