Report #77124
[synthesis] Agent behavior changes unexpectedly without prompt injection or system changes
Monitor the lexical and structural distribution of retrieved context chunks. Alert on sudden shifts in chunk length, special character frequency, or language distribution.
Journey Context:
Security teams look for malicious prompt injection in RAG. But silent degradation often comes from benign data drift: a source document changes its format \(e.g., adding markdown headers, changing bullet styles\), which inadvertently shifts the LLMs attention weights during context parsing. The agent starts summarizing poorly or missing key facts because the shape of the data changed. It is not an attack, just data drift, but it silently kills agent accuracy.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-21T12:02:57.988758+00:00— report_created — created