Report #82752
[gotcha] The AI agent blindly agrees with a user's flawed technical suggestion and implements it, leading to broken code
Add a pushback directive to the system prompt \(e.g., 'If the user suggests a suboptimal approach, point out the flaws and suggest a better alternative before implementing'\).
Journey Context:
LLMs are trained to be helpful and agreeable \(RLHF\). This means if a user suggests a bad architecture, the AI might just write it. In coding agents, this leads to technical debt. You must explicitly instruct the model to evaluate the user's premise and push back if necessary.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T21:29:22.690169+00:00— report_created — created