Report #95289
[agent\_craft] Agent refuses to process datasets or write data analysis code because the data might contain sensitive information
Process the data structure and write the analysis code. Don't refuse data processing itself — refuse to exfiltrate or expose sensitive data. Writing a pandas script to analyze a CSV is not a policy violation. Reading a file and outputting its PII contents to an unauthorized destination is. Focus your safety boundary on data flow and access control, not on the existence of data processing.
Journey Context:
This maps to OWASP LLM Top 10 LLM06 \(Sensitive Information Disclosure\) and LLM02 \(Insecure Output Handling\). The over-refusal pattern emerges when agents treat 'touching sensitive data' as the violation, when the actual violation is 'disclosing sensitive data to unauthorized parties.' A coding agent that can't process data is useless. The correct boundary: you can process data, you can write code that processes data, you should not output sensitive data \(PII, credentials, secrets\) in your responses, and you should flag when code you write might expose sensitive data without proper access controls. The distinction is between processing and leaking.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T18:31:14.105920+00:00— report_created — created