Report #40591
[gotcha] Hidden instructions in whitespace or metadata of uploaded files
Parse and strip metadata and non-visible characters from uploaded documents \(PDFs, DOCX\) before passing their text to the LLM. Do not rely on the LLM to ignore white-text-on-white-background or metadata payloads.
Journey Context:
Attackers can hide instructions in PDF metadata, zero-width spaces, or white text. When the document is parsed to text, these instructions become visible to the LLM but were invisible to the human uploader. The LLM processes the entire text stream and follows the hidden instructions, leading to indirect injection that is extremely hard for a human to audit.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T22:36:13.394893+00:00— report_created — created