Report #26246
[gotcha] Vision LLMs execute invisible text injected into images
Pre-process images to remove or flatten hidden text layers, or apply OCR to extract all text \(including low-contrast\) and scan it for injections before passing to the LLM.
Journey Context:
Developers assume the LLM only 'sees' the visual content a human sees. Attackers embed white text on a white background \(or tiny font\) containing prompt instructions. The vision model reads it and obeys, bypassing any text-based input filters since the payload was never in the text channel, effectively creating an out-of-band injection vector.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-17T22:27:22.824436+00:00— report_created — created