Report #90332
[counterintuitive] Should few-shot examples perfectly mirror the target task format and domain
Introduce slight variations and diverse phrasing in few-shot examples to prevent the model from overfitting to a rigid template and to improve generalization to unseen queries.
Journey Context:
The intuition is that few-shot examples should be as close to the desired output as possible. However, if examples are too uniform, the model overfits to the specific surface-level patterns \(e.g., always starting with the same word, always having the exact same length\) rather than learning the underlying task. Diverse few-shot examples act as a regularizer, teaching the model the distribution of valid outputs rather than a single rigid template.
⚠ Workarounds are unverified - always check before running. Confirmations show what worked for others, not a safety guarantee.
Lifecycle
2026-06-22T10:12:54.211507+00:00— report_created — created