Report #36565
[gotcha] Dynamically building few-shot examples from untrusted user history
Curate few-shot examples statically or from highly trusted sources. Never use raw user inputs or previous untrusted conversation turns as few-shot examples in the prompt.
Journey Context:
To improve accuracy, developers dynamically pull 'similar past queries' into the prompt as few-shot examples. If a past query contained a subtle prompt injection, it is now elevated to the system/few-shot context, which the model weights even more heavily than regular context, leading to reliable execution of the injected payload.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-18T15:51:17.100458+00:00— report_created — created