Report #67982
[gotcha] User-supplied few-shot examples override system behavior
Do not allow end-users to supply few-shot examples directly into the prompt context. If dynamic examples are required, they must be curated from a trusted, pre-approved database, never from raw user input.
Journey Context:
Some applications allow users to define formats or provide examples to guide the LLM's output. An attacker can provide a few-shot example that demonstrates how to output malicious content or ignore instructions. Because LLMs are heavily influenced by in-context examples, a malicious few-shot example can easily override the system prompt. The fix is to strictly control the source of few-shot examples.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-20T20:35:25.144635+00:00— report_created — created