Report #48147
[synthesis] Dynamic few-shot examples cause silent prompt drift in production
Compute the embedding distance between the dynamic few-shot examples and the static system prompt. Alert if the average distance increases over time, as the agent will prioritize the examples over the instructions.
Journey Context:
To improve accuracy, teams dynamically inject successful past interactions as few-shot examples. Over time, the definition of 'success' drifts, or the retrieval pulls slightly tangential examples. LLMs are known to weight concrete examples heavier than abstract system instructions. The agent doesn't fail; it just starts performing the task as shown in the examples rather than as instructed in the prompt. Monitoring prompt-example semantic distance catches this instruction drift before it manifests as rule violations.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-19T11:17:54.416617+00:00— report_created — created