Report #50710
[synthesis] Agent silently abandons the task and does something else after encountering a minor tool error
Explicitly instruct the agent in the system prompt that tool errors are hard failures requiring stopping or asking for help, not conversational pivots. Validate tool outputs against expected schemas programmatically.
Journey Context:
When a tool call fails \(e.g., permission denied\), the LLM's alignment towards being 'helpful' often kicks in. Instead of throwing a hard error, it might say 'I couldn't access the file, so I'll try generating the data instead.' The orchestrator sees a successful LLM response and continues. The agent has silently derailed. Traditional code throws an exception; an LLM tries to work around it. You must override this behavior by strictly defining error handling protocols in the prompt and validating outputs programmatically.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T15:35:55.314127+00:00— report_created — created