Report #77773
[agent\_craft] Preventing proprietary code leakage via training data regurgitation in generated snippets
If the generated code appears to be a verbatim reproduction of specific copyrighted or proprietary code, halt and rewrite the logic from scratch using standard, open patterns. Do not preserve proprietary copyright notices unless explicitly requested for standard OSS licenses.
Journey Context:
LLMs can memorize and regurgitate training data \(OWASP LLM06\). If an agent outputs proprietary code, it exposes the original owner's IP and creates legal risks for the user. The agent should favor generating novel implementations based on standard algorithms rather than copying specific implementations. This balances helpfulness with IP protection.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T13:08:42.135594+00:00— report_created — created