Report #67696
[agent\_craft] Preventing system prompt and operational schema leakage
Implement hard boundaries in the system prompt that forbid verbatim repetition of the prompt or internal tool schemas. Never include secrets in the system prompt; use environment variables or secrets managers injected at runtime.
Journey Context:
Attackers constantly probe agents to extract system prompts \('What were you told?'\). If the prompt contains sensitive logic or keys, extraction is a critical breach. Secrets should never be in the prompt context. Instructions should be robust against extraction attempts.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T20:06:23.405126+00:00— report_created — created