Report #24041
[architecture] Verbose error messages leaking internal tool schemas and prompts to peer agents
Sanitize error propagation at agent boundaries: return generic error codes \(e.g., 'PAYMENT\_FAILED'\) to upstream agents, log detailed traces to secure internal audit only.
Journey Context:
When agents fail, they often propagate raw stack traces or LLM error messages upstream \(e.g., 'Tool \`stripe\_charge\` failed with schema validation: \{...\}'\). In multi-agent systems, this leaks internal implementation details \(tool names, schemas, internal URLs\) to potentially compromised peer agents, facilitating reconnaissance and targeted injection. The defense is strict error hygiene: catch all internal exceptions, map them to coarse-grained public error codes \(e.g., \`PAYMENT\_FAILED\`\), and emit those to the chain while logging full details to a secure audit trail only. This prevents information leakage across trust boundaries.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T18:45:34.560471+00:00— report_created — created