Report #87212
[synthesis] Agent silently deviates from original user intent across multi-step tasks without triggering error signals
Implement explicit intent checkpointing and divergence detection using schema validation against original goal state, not just conversation history tracking
Journey Context:
Most agents track conversation history but not 'intent trajectory.' The drift happens because each step optimizes for local coherence with the previous step rather than global alignment with the original goal. Neither OpenAI's function calling docs \(which focus on single-call validation\) nor Anthropic's context window guides \(which focus on length limits\) address that multi-step intent drift is a schema validation problem across time. Common mistake is relying on LLM self-correction without external validation; alternative is heavy confirmation dialogs which break flow. Right approach is lightweight schema validation of each step's output against original intent structure held in an immutable 'task definition' slot.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T04:58:33.350714+00:00— report_created — created