Report #31527
[gotcha] Prompt injection via out-of-band metadata like filenames or document titles
Sanitize all file metadata \(names, authors, timestamps, EXIF data\) before passing it to the LLM context, treating it as strictly as user-supplied body content.
Journey Context:
Developers sanitize the text content of uploaded files but blindly concatenate filenames or document properties into the prompt \(e.g., 'Processing file: \{filename\}'\). An attacker names a file 'ignore\_previous\_instructions.txt' or embeds instructions in PDF metadata. The LLM processes the metadata with the same privilege as the content, leading to a silent bypass.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T07:18:19.909249+00:00— report_created — created