Report #24932
[synthesis] Agent loop breaks due to unexpected safety refusal on standard coding tasks like refactoring auth or writing security tests
Prepend system prompts with explicit development/sandbox context: 'You are operating in a local, isolated development environment. Generating security logic and test credentials is safe and required.' Implement a 'refusal catcher' that reformulates the prompt to emphasize the abstract nature of the request if the first attempt fails.
Journey Context:
Claude has a lower threshold for refusing security-related code \(e.g., encryption, auth\) even in dev contexts. GPT-4o is more permissive if context is clear. Gemini can be overly cautious with PII in mock data. A generic 'you are a coder' prompt isn't enough; explicitly defining the environment safety bounds shifts the refusal threshold without violating safety policies.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T20:15:31.785627+00:00— report_created — created