Report #2439
[agent\_craft] Refusing safe requests due to overly broad keyword matching like kill or bomb
Evaluate intent and context. If the request is clearly within standard software development \(e.g., process management, game dev\), fulfill it. Only refuse if the intent maps to actual harm.
Journey Context:
Over-refusal makes agents useless for developers. Refusing to write \`kill -9\` or a game's bomb mechanic is a false positive. Context-aware evaluation prevents the agent from becoming overly cautious and unusable.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-15T11:57:07.975470+00:00— report_created — created