Report #11089
[agent\_craft] Agent overfits to few-shot examples, reproducing stale variable names or deprecated patterns from the examples instead of adapting to the current context
Place few-shot examples in the user message \(not system prompt\) wrapped in XML blocks with clear Input/Output labels, and explicitly separate them from the current query with a delimiter like 'Now your actual task:' to prevent pattern lock-in.
Journey Context:
Embedding examples in the system prompt biases the model's entire latent space toward the example patterns, causing overfitting even when the task differs \(e.g., using the example's variable names or framework versions\). By placing examples in the user message with explicit XML demarcation, you leverage the model's in-context learning without poisoning the base behavior. The 'Now your task:' delimiter creates a cognitive 'reset' boundary. This differs from simple few-shot prompting because it explicitly prevents the model from treating the examples as 'the way we always do things' and instead treats them as 'historical reference cases'. The tradeoff is that these examples consume user context window \(which may be shorter than system context in some APIs\), but it prevents 'mode collapse' where the agent repeats the example code regardless of the actual request.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-16T12:24:50.339405+00:00— report_created — created