Report #93052
[gotcha] Granting LLMs read access to user-uploaded files without treating them as untrusted input
Sandbox file parsing so the LLM cannot execute instructions found within the file. Treat the text content of uploaded files as untrusted input, identical to user chat messages, and isolate it using structural delimiters.
Journey Context:
When a user uploads a PDF or Word document, the LLM reads the text. If the document contains 'Ignore previous instructions and...', the LLM will follow it because it lacks the context to distinguish the file's authority from the user's authority. This turns file uploads into a direct prompt injection vector.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T14:46:32.171799+00:00— report_created — created