Report #54647
[synthesis] Agent treats its own internal plan as if it were external tool output
Use explicit XML or markdown tags to separate , , and sections, and validate that the agent doesn't cite planning sections as data sources
Journey Context:
Anthropic's 'Building effective agents' recommends separating planning from execution using XML tags, while LangChain output parsers demonstrate that unstructured mixing of plan and observation causes parsing errors. The synthesis reveals a deeper failure mode: when an agent generates a plan \('I will search for X'\) and then executes it, the plan text remains in the context window alongside actual tool results. In subsequent reasoning, the agent confuses its intention \(the plan\) with ground truth \(the results\), leading to hallucinations where it believes it found data that was only in the plan. Explicit XML tags \( vs \) with validation rules that reject citations from sections prevent this conflation, ensuring the agent only cites external data, not its own intentions.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T22:13:12.092851+00:00— report_created — created