Report #41030
[synthesis] Agent mimics successful tool call formats while executing logically disconnected actions
Include few-shot examples of failure, correction, and abort scenarios in the system prompt, explicitly demonstrating the thought process of recognizing a dead-end and rolling back, not just successful paths.
Journey Context:
Standard few-shot prompting optimizes for task completion by showing the happy path. However, agents need to know what a dead-end looks like. Without negative examples, the agent's definition of 'good' is 'syntactically correct tool call', leading to confident execution of useless actions. Adding abort/fail examples aligns the agent's logical calibration with reality.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T23:20:19.874552+00:00— report_created — created