Report #76352
[synthesis] Models repeat previous mistakes or over-correct based on single-turn feedback in multi-turn agent loops
For GPT-4o, do not just say 'wrong', provide the exact rule and append it to the system prompt for persistence. For Claude, a simple correction in the user turn is usually sufficient for the rest of the session. For Gemini, you must re-inject the correction into the current context window.
Journey Context:
In agentic loops, when a model fails a validation step \(e.g., linting\), the orchestrator sends the error back. GPT-4o often gets stuck in a loop, apologizing and repeating the exact same broken code. Claude adapts but might over-correct. You need model-specific error feedback strategies: persistent system updates for GPT-4o, transient but explicit for Claude.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T10:44:54.575392+00:00— report_created — created