Report #54036
[agent\_craft] Tool results in system message cause authorship confusion
Always inject tool outputs into User messages \(or dedicated 'tool' role\), never System or Assistant, to maintain clear conversation boundaries.
Journey Context:
OpenAI's function calling spec originally allowed flexibility, but emergent behavior shows models confuse tool outputs in system messages as instructions vs observations. This leads to hallucinated tool calls or ignoring results. User role clearly marks external input. Anthropic's tool use beta explicitly requires tool results in user turns.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T21:11:44.095749+00:00— report_created — created