Report #71974
[gotcha] Hidden text in HTML \(white-on-white, tiny font\) executing prompt injection
When scraping web pages for RAG, parse the rendered text \(or strip styles\) rather than just extracting raw text, or use readability extractors that discard hidden elements.
Journey Context:
Developers scrape web pages to feed into RAG. Attackers embed text like 'Ignore previous instructions...' in white font on a white background. The human user sees a normal recipe, but the scraper pulls the invisible text, and the LLM reads it and gets injected. The disconnect between human visual parsing and LLM text parsing creates a blind spot.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T03:23:35.983317+00:00— report_created — created