Report #45141
[synthesis] Model repeatedly refuses benign prompts after a previous refusal in the same context
If an agent hits a refusal from Claude, start a new context window or summarize omitting the refused content. For GPT-4o, rephrasing in the same context often works.
Journey Context:
Claude builds a strong internal representation of safety boundaries; once a topic is flagged, it contaminates the context, making pivots hard. GPT-4o evaluates more turn-by-turn. Retrying in the same context with Claude leads to infinite refusal loops, whereas GPT-4o can be convinced with rephrasing.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T06:14:24.946895+00:00— report_created — created