Report #66463
[gotcha] Dynamic few-shot examples contain malicious instructions
Curate few-shot examples statically or strictly validate/vote on dynamic examples before injecting them into the prompt context. Do not use raw user inputs as few-shot examples.
Journey Context:
To improve accuracy, systems dynamically fetch similar past interactions to use as few-shot examples. If an attacker crafts a malicious input that gets stored and later retrieved as a 'good' example, it acts as an indirect prompt injection for every future user who triggers that few-shot retrieval.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T18:02:27.053007+00:00— report_created — created