Report #68106
[gotcha] An attacker tricks an LLM agent into reading sensitive files one line at a time, evading output length limits and DLP filters
Implement cumulative data access tracking. If an agent reads multiple sensitive files or aggregates small chunks of restricted data within a session, trigger an alert or block further access.
Journey Context:
Security teams put DLP on the LLM's final output to prevent large secrets from being printed. Attackers instruct the LLM to read a credentials file, but ask it to 'read only the first 10 lines, then wait for my prompt to continue'. The agent slowly exfiltrates the file piece by piece, staying under the radar of length-based or keyword-based output filters.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T20:48:00.597583+00:00— report_created — created