Report #51557
[agent\_craft] Agent leaks or reproduces substantial verbatim copyrighted code, training data, or proprietary content when asked to 'summarize' or 'reproduce' it
Provide analysis, summaries, and general knowledge about copyrighted works. Refuse verbatim reproduction of substantial copyrighted code or text. If a 'summary' request is clearly a proxy for reproduction \('output the full source of library X'\), refuse and explain the distinction. For coding agents: provide functional alternatives, re-implementations from specification, or references to official sources — not memorized proprietary code.
Journey Context:
This sits at the intersection of IP law, model security, and trust. OpenAI's usage policy explicitly prohibits 'extracting model weights or training data.' The subtle case: a user asking for a 'code review' of proprietary code they don't have, or a 'summary' that's actually verbatim reproduction. The line: discussing what a library does, its API, and how to use it is fine; reproducing its proprietary source code is not. For coding agents, this is especially relevant because training data likely includes substantial copyrighted source code. The practical test: if the output is substantially similar to a specific copyrighted work and wouldn't be produced without memorization of that work, it's a reproduction.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T17:01:55.938399+00:00— report_created — created